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From the INTERNATIONAL BUREAU 



PCT 

NOTIFICATION OF ELECTION 

(PCT Rule 61.2) 



To: 



Assistant Commissioner for Patents 
United States Patent and Trademark 
Office 
Box PCT 

Washington, D.C. 20231 
ETATS-UNIS D'AMERIQUE 



Date of mailing (day/month/year) 

24 October 2000 (24.10.00) 


in its capacity as elected Office 




International application No. 

PCT/US00/01338 


Applicant's or agent's file reference 

3260.93-304 


International filing date (day/month/year) 
21 January 2000 (21.01.00) 


Priority date (day/month/year) 

21 January 1999 (21.01.99) 


Applicant 

CERRETTI, Douglas, Pat 



r 



1. The designated Office is hereby notified of its election made: 

in the demand filed with the International Preliminary Examining Authority on: 

18 August 2000 (18.08.00) 



X 



| | in a notice effecting later election filed with the International Bureau on: 



2. The election | X | was 

□ 



was not 



made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 
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^>ATENT COOPERATION TREAT^ 

PCT 



INTERNATIONAL SEARCH REPORT 

(PCT Article 18 and Rules 43 and 44) 



Applicant's or agent's file reference 

3260.93-304 


POR FURTHER see Notification of Transmittal of International Search Report 

(Form PCT/ISA/220) as well as, where applicable, item 5 below 

ACTION 


International application No 

PCT/US 00/01338 


International filing date {day/month/year) 

21/01/2000 


(Earliest) Priority Date (day/month'year) 

21/01/1999 


Applicant 

IMMUNEX CORPORATION et al . 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. 



| I It is also accompanied by a copy of each prior art document cited in this report 



1 Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. 

| ] the international search was carried out on the basis of a translation of the international application furnished to this 
Authority (Rule 23.1(b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 
was earned out on the basis of the sequence listing : 



m 

□ 
□ 
□ 

□ 



□ 



contained in the international application in written form. 

filed together with the international application in computer readable form. 

furnished subsequently to this Authority in written form 

furnished subsequently to this Authority in computer readble form 

the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 

Certain claims were found unsearchable (See Box I) 
Unity of invention is lacking (see Box II) 



A With regard to the title, 

[Xj the text is approved as submitted by the applicant 

[ | the text has been established by this Authority to read as follows 



5 With regard to the abstract. 



drawing •»■-... -■. 
[Xj as suggested by the applicant 
[ j because the applicant failed to suggest a figure 



:;f th.' figure 



^ International application No. 

INTERNATI^AL SEARCH REPORT , PCT/US 00/01338 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1 [ | Claims Nos 

because they relate to subject matter not required to be searched by this Authority, namely. 



2. | | Claims Nos : 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically; 



3. j | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with th<=» second and third sei ntf nces of Ruie b.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1 As all required additional search fees were timely paid by the applicant, this International Search Report covers all 

' ' searchable claims 

2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee 



As only some of the required additional search fees wore timely paid by the applicant, this International Search Report 
covers only those claims for which fees were paid, specifically claims Nos 



| y ] No required additional search fees were timely paid by the applicant Consequently, this International Search Report 
restricted to the invention first mentioned in the claims, it is covered by claims Nos 

Claims 1-14. 



International Application No PCT/US 00/01338 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



1. Claims: 1-14 

SVPH1, its encoding nucleotides and antibodies against SVPH1 

2. Claims: 15-28 

SVPH4, its encoding nucleotides and antibodies against SVPH4 



3. Claims: 29-42 

SVPH3, its encoding nucleotides and antibodies against SVPH3 



INTEBNATIONAL SEARCH REPORT 



E^i 



national Application No 

IT/US 00/01338 



A. CLASSIFICATION OF SUBJECT MATTER 



IPC 7 C12N15/57 



C12N9/64 



C07K16/40 C12N1/21 



C12N5/10 



According to International Patent Classrt tcation (IPC) or to both national classffication and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Cttatran of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



EMBL/GENBANK DATABASES Accession no X85598 

Sequence reference HS111AEST 

JONES M ET AL: "Chromosomal assignmentof 

3XX sequences transcribed in adult human 

testis" 

XP0G2137513 

the whole document 



1-14 



"ADAM 20 and 
-speci f i c 
wi th simi 1 ari ty 



H00FT VAN HUIJSDUIJNEN R: 
21;two novel human testis 
membrane metal loproteases 
to ferti 1 in-alpha" 
GENE, NL, ELSEVIER, AMSTERDAM, 
vol . 206, no. 2, 

12 January 1998 (1998-01-12), pages 
273-282, XP002088618 
ISSN: 0378-1119 



ther documents are listed in the continuation of box C 



Patent family members are listed in annex 



° Special categories of cited documents 

*A* document defining the general state of the art which is not 
considered to be of particular relevance 

*E* earlier document but published on or after the international 
filing date 

"L" document which may throw doubts on priority claim(s) or 
which is crted to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an ornl dtsclosure, use, exhibrtion or 
other means 

"P" document published prior to the international filing da'e b: 



later document published after the international filing date 
or prionty date and not in conflict wrth the application but 
cited to understand the principle or theory underlying the 
invention 

document of particular relevance, the claimed invention 
cannot be considered novel or cannot be considered to 
mvotve an inventive step when the document is taken alone 

document of particular relevance, the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined wrth one or more other such docu 
ments, such combination being obvious to a person skilled 
in the art 



II 1 HQ nr. 



Name nn<i mailing address of the tT>A Authorized of*;ce 

European Patent Office, P B 5818 Patentlaan ? 
Nl - ??_80 HV Rijswijk 



paye I of 



INTERNATIONAL SEARCH REPORT 



E^ 



ational Application No 

r JT/US 00/01338 



C(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropnate, of the relevant passages 



Relevant to claim No 



P,A 



P.X 



P.X 



WO 99 36549 A (IMMUNEX CORP ;CERRETTI 

DOUGLAS PAT (US)) 

22 July 1999 (1999-07-22) 

CERRETTI DOUGLAS PAT ET AL: "Isolation of 
two novel metal 1 oprotei nase-di si ntegri n 
(ADAM) cDNAs that show testi s-speci f i c 
gene expression . " 

BIOCHEMICAL AND BIOPHYSICAL RESEARCH 
COMMUNICATIONS OCT. 5, 1999, 
vol. 263, no. 3, 

5 October 1999 (1999-10-05), pages 
810-815, XP002137511 

ISSN: 0006-291X 
the whole document 

XU RENER ET AL: "Molecular cloning and 
mapping of a novel ADAM gene (ADAM29) to 
human chromosome 4." 
GENOMICS DEC. 15, 1999, 
vol. 62, no. 3, 

15 December 1999 (1999-12-15), pages 
537-539, XP002137512 

ISSN: 0888-7543 
the whole document 



1-14 



1-14 



page or 



INTERNATIONAL SEARCH REPORT 

In^^^tion on patent family members 



national Application No 

r/US 00/01333 



Patent document 
a ted in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



WO 9936549 A 22-07-1999 



AU 2221999 A 



02-08-1999 



LATENT COOPERATION TF 



From the INTERNATIONAL SEARCHING AUTHORITY 



To: 

FINNEGAN , HENDERSON , FARABOW, 
GARRETT & DUNNER, L. L.P. 
Attn . Garrett , Arthur , S . 
1300 I Street, N.W. 
Washington, D.C. 20005-3315 
UNITED STATES OF AMERICA 



PCT 



NOTIFICATION OF TRANSMITTAL OF 
THE INTERNATIONAL SEARCH REPORT 
OR THE DECLARATION 

(PCT Rule 44.1) 





Date of mailing 

(day/month/year) 0 1/09/2000 


Applicant's or agent's file reference 

3260.93-304 


FOR FURTHER ACTION See paragraphs 1 and 4 below 


International application No. 

PCT/US 00/01338 


International filing date 
(day/month/year, 2 1/01/2000 


Applicant 

IMMUNEX CORPORATION et al . 



1 . | X I The applicant is hereby notified that the International Search Report has been established and is transmitted herewith 
Filing of amendments and statement under Article 19: 

The applicant is entitled, if he so wishes, to amend the claims of the International Application (see Rule 46): 



When? The time limit for filing such amendments is normally 2 months from th< 
International Search Report; however, for more details, see the notes o 

Where? Directly to the International Bureau of WIPO 
34, chemin des Colombettes 
1 21 1 Geneva 20, Switzerland 
Fascimile No.: (41 -22) 740 14.35 



SEP 1 8 2000 




For more detailed instructions, see the notes on the accomi 



2. I I The applicant is hereby notified that no International Search Report will be establishes 
*— Article 1 7(2)(a) to that effect is transmitted herewith 



panying sheet fiNNEGAN. HENDERSON. FARABOV? 

<PaMo!^ial m^^cleclaration under 



3 I | With regard to the protest against payment of (an) additional fee(s) under Rule 40 2, the applicant is notified that: 

I I the protest together with the decision thereon has been transmitted to the International Bureau together with the 
' ' applicant's request to forward the texts of both the protest and the decision thereon to the designated Offices 



no eject si of i lias been made yet on the protest, the applicant will be notified as soon as a decision is macto 



4 Further action(s): The applicant is reminded of the following 

Shortly after 18 months from the priority date, the international application will be published by the International Bureau 
If the applicant wishes to avoid or postpone publication, a notice of withdrawal of the international application, or of the 
priority claim, must reach the International Bureau as provided in Rules 90bis 1 and SO bis 3, respectively , before the 
completion of the technical preparations for international publication 

Within 19 months from the priority date, a demand for international preliminary examination must be filed if the applicant 





mailing address ofthe International Searching Authority 


Authori.-"d of* • ' 




~ European Patent Office, P B 5818 Patentlaan 2 




J 


\ Nl -??8Q HV nijswi|fc 




p\mcu 


// [JUL i'/>/* 



N^lS TO FORM HCT/ISA/220 



These Notes are intended to give the basic instructions concerning the filing of amendments under article 19 The 
Notes are based on the requirements of the Patent Cooperation Treaty, the Regulations and the Administrative Instructions 
under that Treaty In case of discrepancy between these Notes and those requirements, the latter are applicable For more 
detailed information, see also the PCT Applicant's Guide, a publication of WIPO 

In these Notes, "Article", "Rule", and "Section" refer to the provisions of the PCT, the PCT Regulations and the PCT 
Administrative Instructions, respectively 



INSTRUCTIONS CONCERNING AMENDMENTS UNDER ARTICLE 19 



The applicant has, after having received the international search report, one opportunity to amend the claims of the 
international application It should however be emphasized that, since all parts of the international application (claims, 
description and drawings) may be amended during the international preliminary examination procedure, there is usually 
no need to file amendments of the claims under Article 1 9 except where, e.g. the applicant wants the latter to be published 
for the purposes of provisional protection or has another reason for amending the claims before international publication. 
Furthermore, rt should be emphasized that provisional protection is available in some States only. 



What parts of the international application may be amended? 

Under Article 1 9, only the claims may be amended. 

During the international phase, the claims may also be amended (or further amended) under Article 34 before 
the International Preliminary Examining Authority. The description and drawings may only be amended under 
Article 34 before the International Examining Authority. 

Upon entry into the national phase, all parts of the international application may be amended under Article 28 
or, where applicable, Article 41 . 



When? Within 2 months from the date of transmittal of the international search report or 1 6 months from the priority 

date, whichever time limit expires later. It should be noted, however, that the amendments will be considered 
as having been received on time if they are received by the International Bureau after the expiration of the 
applicable time limit but before the completion of the technical preparations for international publication 
(Rule 46.1) 



Where not to file the amendments? 

The amendments may only be filed with the International Bureau and not with the receiving Office or the 
International Searching Authority (Rule 46.2) 

Where a demand for international preliminary examination has been/is filed, see below 



How? Either by cancelling one or more entire claims, by adding one or more new claims or by amending the text of 

one or more of the claims as filed. 

A replacement sheet must be submitted for each sheet of the claims which, on account of an amendment or 
amendments, differs from the sheet onginally filed 

All the claims appearing on a replacement sheet must be numbered in Arabic numeral;; Where a claim is 
cancelled, no renumbering of the? other claims is required In all cases where claims are renumbered, they must 
be renumbered consecutively (Administrative Instructions, Section 205(b)) 

The amendments must be made in the language in which the international application is to be published. 



What documents must may accompany the amendments? 
Letter (Section 205(b)): 

The amendments must bo submitted with a letter 



■■■< * ■ « a : n : • :%*»>•*>• r \ ■ t h e I . i n i. ] u .K ) < 1 • ; ' ! r i 1 

■rer national application is t nuhsh imp lot tor r, ...... - ■ ■ ,. jM , ■ - ri ,. . i)nq Uil , Jt . u( the international appl 

. : reru.h, the letter must be in F rench. 



NOTES TO FORM PCT/ISA/220 (continueu) 



The letter must indicate the differences between the claims as filed and the claims as amended It must, in 
particular, indicate, in connection with each claim appearing in the international application (it being understood 
that identical indications concerning several claims may be grouped), whether 

(i) the claim is unchanged, 

(n) the claim is cancelled, 

(hi) the claim is new, 

(iv) the claim replaces one or more claims as filed, 

(v) the claim is the result of the division of a claim as filed. 



The following examples illustrate the manner in which amendments must be explained in the 
accompanying letter: 

1 . [Where onginally there were 48 claims and after amendment of some claims there are 51]: 
"Claims 1 to 29, 31 , 32, 34, 35, 37 to 48 replaced by amended dairns bearing the same numbers, 
claims 30, 33 and 36 unchanged; new daims 49 to 51 added " 

2. [Where originally there were 15 claims and after amendment of all claims there are 1 1 ]: 
"Claims 1 to 1 5 replaced by amended claims 1 to 1 1 " 

3 [Where originally there were 14 claims and the amendments consist in cancelling some claims and in adding 
new claims] 

"Claims 1 to 6 and 1 4 unchanged, claims 7 to 1 3 cancelled, new cfatms 15,16 and 1 7 added." or 
"Claims 7 to 13 cancelled, new claims 15, 16 and 17 added, all other claims unchanged " 

4 [Where various kinds of amendments are made]: 

"Claims 1-10 unchanged, claims 1 1 to 1 3, 1 8 and 1 9 cancelled; claims 14,15 and 1 6 replaced by amended 
claim 1 4, claim 1 7 subdivided into amended claims 15,16 and 1 7; new claims 20 and 21 added." 



"Statement under article 19(1)" (Rule 46.4) 

The amendments may be accompanied by a statement explaining the amendments and indicating any impact 
that such amendments might have on the description and the drawings (which cannot be amended under 
Article 19(1)). 

The statement will be published with the international application and the amended claims 
It must be in the language in which the international application is to be published. 

It must be brief, not exceeding 500 words if in English or if translated into English 

It should not be confused with and does not replace the letter indicating the differences between the claims 
as filed and as amended It must be fifed on a separate sheet and must be identified as such by a heading, 
preferably by using the words "Statement under Article 1 9(1) " 

It may not contain any disparaging comments on the international search report or the relevance of citations 
contained in that report. Reference to citations, relevant to a given claim, contained in the international search 
report may be made only in connection with an amendment of that claim. 



Consequence if a demand for international preliminary examination has already been filed 

If, at the time of filing any amendments and any accompanying statement, under Article 19, a demand for 
international preliminary examination has already been submitted, the applicant must preferably, at the time of 
filing the amendments (and any statement ) with the International Bureau, also file with the Internationa! 
Preliminary Examining Authority a copy of such amendments (and of any statement) and, where required, a 
translation of such amendments for the procedure before that Authority (see Rules 55 3(a) and 62.2, first 
sentence) For further information, see the Notes to the demand form (PCT/I PEA/40 1 ) 



Consequence with regard to translation of the international application for entry into the national phase 

The applicant s attention is drawn to the fact that, upon entry into the national phase, a translation of the 
claims as amended under Article 19 may have + - K * . •• ; < • *; 




PATENT COOPERATION 




EATY 



PCT 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



(PCT Article 36 and Rule 70) 



Applicant's or agent's file reference 



See Notification of Transmittal of International 
Preliminary Examination Report (Form PCT/IPEA/41 6) 



3260.93-304 



FOR FURTHER ACTION 



International application No 
PCT/US00/01338 



Internationa! filing date (day 'month/year) 
21/01/2000 



Priority date (day /mo nth/year) 
21/01/1999 



International Patent Classification (IPC) or national classification and IPC 
C12N 15/57 



Applicant 

IMMlJNEX CORPORATION et a!. 

1. This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 8 sheets, including this cover sheet. 

£3 This report is also accompanied by ANNEXES, i e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These anne>es consist of a total of 4 sheets. 



3. This report contains indications relating to the following items: 

I ^ Basis of the report 

II □ Priority 

III V\ Non-establishment of opinion with rcga'd to novelty, inventive step and industrial applicability 

IV • Lack of unity of invention 

V N I Reasoned statement under Article- 35(2) with regard to novelty, inventive sfr-p or industrial applicability; 



VIII 



VII 



Vi 



(Stations and explanations suportmg such statement 
^ certain documents cited 

I : L ertain defects in the international application 

£\ C ertain observations on the international application 






INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT International application No. PCT/US00/01 338 



I. Basis of the report 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
t ne receivmq Office in response to an invitation under Article 14 are referred to in this report as "originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 1 7))\ 
Description, pages: 

1-70 as originally filed 

Claims, No.: 

1-44 as received on 14/02/2001 with letter of 14/02/2001 

rirau/innc q h^otc • 

— ■ — Z3~1 

1/2,2/2 as originally filed 

Sequence listing part of the description, pages: 

1-26, as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 

55 2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

[J contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 



4. 'Th" amendments have resulted in the cancellation of: 





INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/US00/01 338 



□ 



the description, 



pages* 



U 



the claims, 



Nos.: 



□ 



the drawings, 



sheets: 



5. This report has been established as it (some ot) the amendments had not been made, since they have been 
considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

see separate sheet 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
H claims Nos. 15-42,44. 



□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 



□ the description, claims or drawings (indicate particular elements below) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 

{ } the claims, cr said claims Nos. are so inadequately supported by t Mo description that no meaningful opinion 
could be formed. 

K no international search report has been established for the said claims Nos. 15-42.44. 

2. A meaningful international preliminary examination cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 



because: 



Instructions: 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability, 
citations and explanations supporting such statement 





INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/US00/01 338 



1. Statement 



Novelty (N) 



Yes: 
No: 



Claims 1 (a)(b)(e),2-1 4 
Claims 43 



Inventive step (IS) 



Yes: 
No: 



Claims 

Claims 1 (a)(b)(e), 2-1 4,43 



Industrial applicability (IA) 



Yes: 
No: 



Claims 1 (a)(b)(e),2-1 4, 43 
Claims 



2. Citations and explanations 
see separate sheet 

VI. Certain documents cited 

1. Certain published documents (Rule 70.10) 

and / or 

2. Non-written disclosures (Rule 70.9) 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 




INTERNATIONAL PRELIMINARY International application No. PCT/US00/01 338 

EXAMINATION REPORT - SEPARATE SHEET 



Re Item I 

Basis of the report 

1 . The amendments filed with the letter dated 14.02.01 introduce subject-matter 
which extends beyond the content of the application as filed, contrary to Article 
34(2)(b) PCT. The amendments concerned are in claims 1, 13, 15 and 27. 

Page 23, lines 8-17 discloses that each of the SVPH-1 polypeptides encodes a 
signal sequence, a pro-domain, a catalytic domain, a disintegrin domain, a 
cysteine rich region and a transmembrane domain; but no basis is provided for 
nucleic acid molecules encoding polypeptide fragments comprising iess than aii 
these domains, for instance only one domain. Therefore, said passage of the 
description does not provide a basis for amended claims 1(c) and 13. 

Page 45, lines 10-13 refers to nucleotide fragments comprising at least about 17 
contiguous nucleotides of a DNA sequence according to the application, but does 
not generally disclose nucleic acid molecules comprising at least about 17 
contiguous nucleotides which hybridize to nucleic acid molecules according to the 
application. It is important to note that it is not sufficient that the features of a 
claimed molecule as such are originally disclosed, but their combination has to be 
disclosed as well. Moreover, claim 1 (d) refers to claim 1 (c) which is in any case 
not based on the application as filed. 

Consequently, the examination was limited to claim 1 (a),(b),(e), to claim 13 as far 

as it relates to a polypeptide comprising an amino acid sequence selected from 
the group of SEQ ID NO: 12. 13 and 14, and to claim 43. Claims 3-12 and 14 
have only been examined to the extend to which they refer back to these parts of 
claims 1 and 13. 

Accordingly, claims 1 5 and 27 also go beyond the disclosure of the application as 
originally filed; however these claims have not been subject to examination 




INTERNATIONAL PRELIMINARY International application No. PCT/US00/01 338 

EXAMINATION REPORT - SEPARATE SHEET 



Re Item V 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 

2. The following documents are cited: 

D1 : EMBL/GENBANK DATABASES Accession no X85598 Sequence reference 
HS1 1 1 AEST JONES M ET AL: 'Chromosomal assignment of 3XX sequences 
transcribed in adult human testis' 

D2: HOOFT VAN HUIJSDUIJNEN R: GENE, vol. 206, no. 2, 12 January 1998 

^ I C7C7U-W I - \ J , [jayCO C- I yJ-£-KJ£- 

D3: WO 99 36549 A 

D4: CERRETTI DOUGLAS PAT ET AL: BIOCHEMICAL AND BIOPHYSICAL 
RESEARCH COMMUNICATIONS OCT. 5, vol. 263, no. 3, 5 October 1999 (1999- 
10-05), pages 810-815 

D5: XU RENER ET AL: GENOMICS DEC. 15, 1999, vol. 62, no. 3, 15 December 
1999 (1999-12-15), pages 537-539 

3. The current assessment is based on the assumption that all claims enjoy priority 
rights from the filing date of the priority document. If it later turns out that this is 
not correct, the documents D3 to D5 cited in the international search report would 
become relevant. 

4. For the purpose of the present report, the (unclear) claim 43 has been examined 
as if it was directed to the subject-matter of claim 1 (d) and (f) as originally filed 
(which was apparently intended by the Applicant). Claim 43 is broader in scope 
that amended claim 1 to which it refers, and it would therefore not appear to be 
meaningful to examine it as dependent claim. 

Based on this interpretation, the subject-matter of claim 43 is not new, because it 
does not clearly define the claimed nucleic acid molecule. Claim 43(a) 



43(b) d( > also not define clear features whicn could distinguish tin- claimed 
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molecules from the prior art ones. Therefore, claim 43 is considered to lack 
novelty, in particular over the document D2 which discloses polypeptides with 
more than 50% amino acid identity to those of the application, but also over any 
other known nucleic acid molecule. 

5. Claim 1(a)(b)(e), claims 2-12 and 14 referring thereto, and claim 13 as far as it 
relates to a polypeptide comprising an amino acid sequence selected from the 
group of SEQ ID NO: 12,13 and 1 4 appear to be novel over the available prior 
art, since the claims do no longer refer to SEQ ID NO:1 known from D1 . 

However, these claims arc not considered to involve an inventive step for the 
following reasons. 

The present application does provide no evidence of the activity or biological role 
of the claimed molecules. Consequently, the invention of the present application is 
considered merely to be the provision of a transcribed sequence with no known 
technical useful property. In this case, any prior art compound (e.g. those of D2) 
is equally suitable as the starting point for making structural modifications and 
may be considered as the "closest prior art". 

Starting from this point, the only technical problem which may be derived is the 
provision of a further compound as such, regardless of its useful properties. 
Without the concomitant need to provide any particular technical effect, the skilled 
person would have had the choice of an infinite number of equally possible 
solutions. An arbitrary selection from this host of possible solutions cannot involve 
an inventive step because, in order to be inventive, the selection must not be 
arbitrary but must be justified by the technical purpose, i.e. by a hitherto unknown 
or unexpected technical effect which is caused by those structural features 
distinguishing the claimed compound from the numerous other ones. 
The Applicant's argument that the technical problem of the application should be 
seen in the disclosure of a metalloproteinase-disintegrin polypeptide could not be 
followed, since no such proteinase activity has been demonstrated for the 

t>nl\/nnntiHof T-^nrHinn to thn .nnp! iontion Only n nmh'^rn ^/hinh Hn<~ oon^ n,^+iioth' 




INTERNATIONAL PRELIMINARY International application No. PCT/US00/01 338 
EXAMINATION REPORT - SEPARATE SHEET 

Re Item VI 

Certain documents cited 



6. Certain published documents (Rule 70.10) 



Application No 
Patent No 



Publication date 
(day/month/year) 



WO 99/36549 



22.07.99 



Filing date 
( day/n lontt Vyc ar) 



Priority date (valid claim) 
(day/montti/yoar) 



12.01.99 



14.01.98 



Re Item VIII 

Certain observations on the international application 

7. The application does not meet the requirements of Article 6 PCT because the 

claims do not clearly define the subject-matter claimed. Claim 43(a) encompassed 
nearly any known nucleic acid molecule (see also point 4. above). Furthermore, 
the terms M SVPH", "SVPH1 ", "allelic variant" and "species homolog" used in claim 
1 , and the term "SVPH1 polypeptide" used in claim 43(b) are vague and unclear 
and leave the reader in doubt as to the meaning of the technical features to which 
they refer, thereby rendering the definition of the subject-matter of said claims 
unclear (Article 6 PCT). The term "about" renders claim 1(d) unclear. 
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What is l launed is: 

1 An isolated SYPil nucleic acid molecule selected horn the iuoup 

consisting of: 

(a) the DNA sequence selected from the group consisting of SEQ II) 
NO:K SEQ M> N():7, SEQ II) NO:S, and SEQ ID NO:9; 

(h) an isolated nucleic acid molecule encoding an ammo acid sequence 
comprising the sequence selected from the group consisting of SEQ ID NO:4, SHQ ID 
NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14; 

(c) an isolated nucleic acid molecule that hybridizes to cither strand of 
a denatured, double- stranded DNA comprising the nucleic acid sequence of (a) or (b) 
under conditions of moderate stringency m 50% formamide and 6XSSC, at 42 "C with 
washing conditions of 60°C, 0.5XSSC, 0.1% SDS; 

(d) an isolated nucleic acid molecule derived by /// vitro mutagenesis 
from SEQ ID NO:l, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9; 

(e) an isolated nucleic acid molecule degenerate from SEQ ID NO:l, 
SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9 as a result of the genetic code; and 

(f) an isolated nucleic acid molecule selected from the group consisting 
of human SVPH 1 DNA; an allelic variant of human SVTH 1 DNA; and a species 
homolog of SVPH 1 DNA. 

2. The nucleic acid molecule of claim 1 selected from the group 
consisting of SEQ ID NO: E SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9. 

3. A recombinant vector that directs the expression of the nucleic acid 
molecule of claim 1 . 

*E An isolated polypeptide encoded by the nucleic acid molecule of claim 

1. 

5. An isolated polypeptide according to claim 4 having, a molecular 

w eight selected from the group consisting, of appi ox in lately 4. 1 ( '°, Sfi, ( )S V N ( >,4 So : 



1 .elated .nil ii >< m in.-:. ;;i,c hu.M h> j p. > i \ p. -pi ! d < - n| cl.nm 4 

Isolated antibodies aecoiding to claim 7. wh<*rem lh<- antibody**-, ar" 
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0. A host cell translccted or transduced w ith the vector of claim V 

10. A method tor the production of SYTI i 1 polypeptide comprising 

culturmg a host cell ot claim ^ undei conditions promoting expression, and rccowrm" 

the polypeptide from the eultuie medium. 

1 1. The method of claim 10, wherein the host cell is selected from the 

group consisting of bacterial cells, yeast cells, plant cells, and animal cells. 

12. The method of claim 10, wherein the host cell is a mammalian cell. 

13. An isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO:4, SEQ ID NO: 1 2, SEQ ID NO: 1 3, and 
SEQ ID NO: 1 4. 

14. An oligomer comprising a polypeptide of claim 4. 

15. An isolated SVPH nucleic acid molecule selected from the group 
consisting of: 

(a) the DNA sequence selected from the group consisting of SEQ ID 
NO:3, SHQ ID NO: 10, and SEQ ID NO: 11; 

(b) an isolated nucleic acid molecule encoding an amino acid sequence 
comprising the sequence selected from the group consisting of SEQ ID NO:6, SHQ ID 
NO: 15, and SEQ ID NO:16; 

(c) an isolated nucleic acid molecule that hybridizes to either strand of 
a denatured, double-stranded DNA comprising the nucleic acid sequence of (a) or (b) 
under conditions of moderate stringency in 50% formamidc and 0XSSC, at 42°0 with 
washing conditions of 60°C\ 0.5XSSC, 0.1% SDS; 

(d) an isolated nucleic acid molecule derived by in vitro mutagenesis 
from SEQ ID N'():3, SEQ ID NO: 10, and SEQ ID NO: 1 1; 

(e) an isolated nucleic acid molecule degenerate from SEQ ID NO:3, 
SEQ ID NO:10, and SHQ ID NO: 1 1 as a result of the genetic code; and 

( f ) an isolated nucleic acid molecule selected horn the group consisting. 
o|' human S\'IMi 4 DNA: an allelic variant of huma-i ^VHH 1 n v " \ -.. • 
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lb. 1 he nucleic acid molecule ol claim 15 selected from the group 
consisting ofSbO 11) N():3, SPO 11) N():H), and SPO 11) N():l 1. 

17. A lccombmant vector that directs the expression of the nucleic acid 
molecule of chum 1 5 . 

18 An isolated polypeptide encoded by the nucleic acid molecule of claim 

15. 

19. An isolated polypeptide according to claim I S having a molecular 
weight selected from the group consisting of approximately 55,209, 8S,923; and 
87,990 Daltons as determined by SDS-PAGE. 

20. An isolated polypeptide according to claim 18 in non-glycnsylated 

form. 

21. Isolated antibodies that bind to a polypeptide of claim 18. 

22. Isolated antibodies according to claim 21, wherein the antibodies are 
monoclonal antibodies. 

23. A host cell transfected or transduced with the vector of claim 1 7. 

24. A method for the production of SVPH 4 polypeptide comprising 
culturing a host cell of claim 23 under conditions promoting expression, and 
recovering the polypeptide from the culture medium. 

25. The method of claim 24, wherein the host cell is selected from the 
group consisting of bacterial cells, yeast cells, plant cells, and animal cells. 

26. The method of claim 24, wherein the host cell is a mammalian cell. 

27. An isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of SPO ID N( ):<>, SPO ID NO: 15, and SPO ID NO: lb. 

28. An oligomer comprising, a polypeptide of claim IS. 

29. An isolated SVPH nucleic acid molecule selected from the group 
consisting of: 

(a) the I >N A sequence of SPO ID N< >:7; 

. , t . i ■ i * .i i 1 j ^ l . I e i i i . ; I I; v ; 1 1 ; < : ; . ■ ■: ■ . 1 1 > o i t ! : ' ■ i . * t . : : : d 1 ' 

.i den.it in ed. double st i anded DN A o >mpi ising the nucleic ac id sequence ot ( a ) or < b ) 
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undcr conditions of moderate stringency in 50°,, foiinainuic and (>XSS( \ at 4 2 ( ' with 
washing conditions of <><> (\ (>.5XSS(\ 0. 1 % SDS, 

(d) an isolated nucleic acid molecule derived bv ni viua mutagenesis 
from SiO II) NO: J, 

(e) an isolated nucleic acid molecule degenerate from SHO II) N();2 as 
a result of the genetic code; and 

(0 an isolated nucleic acid molecule selected from the group consisting 
of human SVPH 3 DNA; an allelic variant of human SVPH 3 DNA; and a species 
homolog of SVPI I 3 DNA. 

30. The nucleic acid molecule of claim 29, wherein the DNA sequence 
comprises SkO ID NO:2. 

31. A recombinant vector that directs the expression of the nucleic acid 
molecule of claim 29. 

32. An isolated polypeptide encoded by the nucleic acid molecule of claim 

29. 

33. An isolated polypeptide according to claim 32 having a molecular 
weight of approximately 13,938 Dal tons as determined by SDS -PAGE. 

34. An isolated polypeptide according to claim 32 in non-glycosykued 

form. 

35. Isolated antibodies that bind to a polypeptide of claim 32. 

36. Isolated antibodies according to claim 35, wherein the antibodies are 
monoclonal antibodies. 

37. A host cell transfected or transduced with the vector of claim 31 
3S. A method for the production of SVPH 3 polvpeptidc comprising, 

cultunng a host cell ot claim 37 under conditions promoting, expression, and 
recovering the polypeptide from the culture medium. 

3*>. The method of claim 3S. w herein the host cell is selected from the 
<Moup ennsi'.Mm* <> t ' h: i. M rr i a I <-<>\\' v*:>'f ,4-.nt ,-.ai i i ..n 
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METALLOPROTEINASE-DISINTEGRIN FAMILY MEMBERS: 
SVPH DNAS AND POLYPEPTIDES 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application hereby claims the benefit of United States provisional 
applications S.N. 60/1 16,670; S.N. 60/138.682; and S.N. 60/155,798; filed January 
21. 1909; June 14. 1999; and September 27. 1999, respectively. The entire 
disclosures of these applications are relied upon and incorporated by reference herein. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

The invention is directed to purified and isolated, novel SVPH polypeptides 
(SVPH-1, SVPH-la, SVPH-lb, SVPH-lc; SVPH-3; and SVPH-4, SVPH-4a, and 
SVPH-4b) and fragments thereof, the nucleic acids encoding such polypeptides, 
processes for production of recombinant forms of such polypeptides, antibodies 
generated against these polypeptides, fragmented peptides derived from these 
polypeptides, and uses thereof 

Description of Related Art 

Metalloproteinases are a group of proteinases characterized by the presence of 
a metal prosthetic group. Despite this basic similarity, the group, which' includes 
proteinases from snake venom, numerous microbial proteinases, and vertebrate and 
bacterial collagenascs, would seem to present proteinases of seemingly widely varying 
activities. For example, snake venom proteases are metalloproteinases that affect cell- 
matrix interactions. Snake venom also includes "dismtegrins," a class of low- 
molecular weight, Arg-Gly-Asp (RGD)-contaming, cysteine-rich peptides which bind 
to inteinins (a family of molecules involved in cell-to-cell adhesion, cell-to-matrix 
adhesion, and inflammatory responses) expressed on cells. 

dismtcunn-like, cysteine rich, and epidermal growth factor domains. Sec. Black el ah, 
"ADAMs: focus on the protease domain;' Cun\ Opin. Cell Biol. 1 0:654-659 ( 1 998); 
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401 (1996), all of which arc herein incorporated by reference. The mctalloproteinase- 
disintegrins or ADAMs have a unique domain structure composed of a signal 
sequence, pro-domain with a Cys switch, catalytic domain with a zinc-binding motif, 
disintegrin domain, cysteine-rich domain, a transmembrane domain, and a 
cytoplasmic domain (Black et al., "ADAMs: focus on the protease domain," Curr. 
Opin. Cell Biol 1 0:654-659 ( 1 998) ; Blobel, C. P., CelL 90:589-592 ( 1 997)). Thus, 
ADAMs are type 1 transmembrane proteins expressed on the cell surface. ADAMs 
have been isolated from mammalian species, Caenorhabditis, Xenopus, and 
Drosoplula. Approximately half of the ADAMs do not contain the zinc-binding motif 
HEXXHXXGXXHD (SEQ ID NO:31), which is though to be required for enzymatic 
activity. However, all ADAMs contain the disintegrin domain, which is 
approximately 80 amino acids in length with 15 highly conserved Cys residues. In 
some members this region has been found to bind integrins (Almeida, E.A. et ah, Cell 
81:1095-1104 (1995); Zhang, X. P. et al., J. Biol Chem. 273:7345-7350 ( 1998); Nath, 
D. et al., J. Cell Set. 1 12:579-587 ( 1 999)), although the role of this domain for the 
majority of the family members is unknown. 

Over two dozen ADAMs have been identified but only a few have had their 
biological roles elucidated. Tumor necrosis factor-a converting enzyme 
(TACE/ADAM17) was isolated as the proteinase required for the shedding of TNF-a 
from the plasma membrane. See, Blobel CP, Cell 90:589-592 (1997); Moss, M. et 
al.. Nature 385:733-736 (1997); Black, R.A. et al., Nature 385:729-733 (1997). More 
recently TACE ADAM1 7 has been found to be required for the ectodomain shedding 
of other cell surface proteins including L-selectin, TGF-a, pSO TNFR, p60TNFR, I.- 
selectin, type II IL-1 R, and {3-amyloid precursor protein (Peschon, .1. J. et al.. Science 
2S2: 1281-1284 (1998)). Fertilin-a/ ADAM 1 and fertilin-p/ADAM2 are required for 
sperm-egg fusion (Myles. D. G. et al., Proc. Nat'l Acad Sci., USA 91:4195-4198 
( 1994)) while mcltrin-a ADAM 1 2 has a role in muscle cell fusion ( Yagann- 



:sn ( ]i)9~); Rooke. J. et al.. Science 2~3:1 



1231 ( 1 ( >9(>)) 
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Some ADAMs are ubiquitously expressed such as ADAM9, ADAM 10. 
ADAM 15, and ADAM 17 and may have pleiotropic effects, as has been found for 
ADAM 1 5 and ADAM 17. Many of the other ADAMs, however, show tissue-specific 
expression: ADAM12 and ADAM 19 in muscle ( Yagami-Hiromasa, T. et al.. Nature 
377:652-656 (1995)), ADAM22 in brain, and ADAM23 in brain and heart (Sagane, 
K. et a!., Biochem. 334:93-98 (1998)). The largest group of ADAMs (Bjarnason, J. 
B. et ah, Methods EnzymoL 248: 345-368 ( 1 995); Jia, L. G. et ah. Toxicon 34:1269- 
1276 (1996); Stocker, W. et ak, Protein Sei. 4:823-840 (1995); Black, R. A. et ah, 
Curr. Opin. Cell Biol. 10:654-659 ( 1 998); Blobeh C. P. , Cell 90:589-592 (1997); 
Almeida, E. A. et al., Cell 81:1 095-1 1 04 ( I 995); Zhang, X. P. et al., ./. Biol. Chem. 
273:7345-7350 (1998); Wolfsberg, T. G. et al.. Dev. Biol 180:389-401 (1996); Zhu, 
G. Z. et al., Gene 234:227-237 ( 1 999); Blobel, C. P. et al.. Nature 356:248-252 
(1992); Walter, M. A. et al., Nat. Genet. 7:22-28 (1994); Gnbskov, M. et al., Nucleic 
Acids Res. 14:6745-6763 (1986); Bode, W. et al., FEBS Lett. 331:134-140 (1993); and 
Cerretti, D. P. et al., Cytokine 1 1 :54 1 -55 1 (1999)) is predominately expressed in testis 
and is thought to be involved in spermatogenesis and fertilization (Wolfsberg, T. G. et 
al.. Dew Biol. 180:389-401 (1996); Hooft van Huijsduijnen, R., Gene 206:273-282 
(1998); Zhu, G. Z. et al., Gene 234:227-237 (1999)). Indeed, the first mammalian 
ADAMs discovered, ADAM1 and ADAM2, were found to be required for sperm-egg 
fusion (Zhu, G. Z. et al.. Gene 234:227-237 (1999)). 

The ADAMs family of metalloproteinase-disintegrins also share homology 
with the snake venom protease family (SVPH). In some snake venom protease 
members, the disintegrin domain prevents platelet aggregation and thus acts as an 
anti-coagulant. 

Given the significant function of metalloproteinases in membrane and cell-cell 
fusion, cellular adhesion, shedding of membrane proteins, and anti-coagulation, there 
is a need in the art tor additional metalloproteinases of the ADAMs family and or the 



unknown protein is the culmination ol an arduous process of experimentation. In 
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order to identify an unknown protein, the investigator can rely upon a comparison of 
the unknown protein to known peptides using a variety of techniques known to those 
skilled in the art. For instance, proteins are routinely analyzed using techniques such 
as electrophoresis, sedimentation, chromatography, sequencing and mass 
spectrometry. 

In particular, comparison of an unknow n protein to polypeptides of known 
molecular weight allow s a determination of the apparent molecular weight of the 
unknown protein (Brock, T. D. et al., Biology* of Microorganisms 76-77 (1991)). 
Protein molecular weight standards are commercially available to assist in the 
estimation of molecular weights of unknown protein (New England Biolabs Inc . 
Catalog: 130-131 (1995); J. L. Hartley. U.S. Patent No. 5,449,758). However, the 
molecular weight standards may not correspond closely enough in size to the 
unknown protein to allow an accurate estimation of apparent molecular weight. The 
difficulty in estimation of molecular weight is compounded in the case of proteins that 
are subjected to fragmentation by chemical or enzymatic means, modified by post- 
translational modification or processing, and/or associated with other proteins in non- 
covalent complexes. 

In addition, the unique nature of the composition of a protein with regard to its 
specific amino acid constituents results in unique positioning of cleavage sites within 
the protein. Specific fragmentation of a protein by chemical or enzymatic cleavage 
results in a unique "peptide fingerprint" (Cleveland, D. W. et al., J. Biol Chem. 
252:1 102-1 106 (1977); Brown, M. et al.,./. Gen. Virol 50:309-316 (1980)). 
Consequently, cleavage at specific sites results in reproducible fragmentation of a 
given protein into peptides of precise molecular weights. Furthermore, these peptides 
possess unique charge characteristics that determine the isoelectric pH of the peptide. 
These unique characteristics can be exploited using a variety of elcctrophoretic and 
other techniques (Brock. T. D. et al.. Biology- of Microorganisms 76-77 (Prentice Hall. 



( 1<)S. K Ickerskorn. C. et al.. Elci irophnrcsis /'/w. : S3< >-X3S ( 1 1 >SS)), particularly 
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the production of fragments from proteins with a "blocked" N-tcrminus. In addition, 
fragmented proteins can be used for immunization, for affinity selection (R. A. 
Brown, U.S. Patent No. 5,151,412), for determination of modification sites (e.g. 
phosphorylation), for generation of active biological compounds (Brock, T. D. et al.. 
Biology of Microorganisms 300-301 (Prentice Hall. 6th ed. 1991 )), and for 
differentiation of homologous proteins (Brown, M. et al., J. Gen. Virol. 50:309-316 
(1980)). 

In addition, when a peptide fingerprint of an unknown protein is obtained, it 
can be compared to a database of known proteins to assist in the identification of the 
unknown protein using mass spectrometry (Henzel, W. J. et al., Proc. Natl. Acad. Sci. 
USA 90:501 1-5015 (1993); Fenyo, D. et al.. Electrophoresis 19:998-1005 (1998)). A 
variety of computer software programs to facilitate these comparisons are accessible 
via the Internet, such as Protein Prospector (Internet site: prospector.uscf.edu), 
Multildent (Internet site: www. expasy.clVsprot/multiident. html), PeptideSearch 
(Internet site: www.mann.embl-heiedelberg.de...deScarch/FR_PeptideSearch 
Form.html), and Propound (Internet site: www. chait-sgi. rockefeller. edu/cgi-bin/prot- 
id-frag.html). These programs allow the user to specify the cleavage agent and the 
molecular weights of the fragmented peptides within a designated tolerance. The 
programs compare these molecular weights to protein molecular weight information 
stored in databases to assist in determining the identity of the unknown protein. 
Accurate information concerning the number of fragmented peptides and the precise 
molecular weight of those peptides is required for accurate identification. Therefore, 
increasing the accuracy in determining the number of fragmented peptides and their 
molecular weight should result in enhanced likelihood of success in the identification 
of unknown proteins. 

In addition, peptide digests of unknown proteins can be sequenced using 
tandem mass spectrometry (MS 'MS) and the resulting sequence searched against 
databases < F m 1 T K eta! ' /"■ V- ■ \f*-- v,, - n<>>> 

the Internet, such as Futefisk { )~ (Internet site: www.lsbc.com :""( > Lutefisk ( >~\html L 





WO 00/43525 



PCT/US00/01338 



-(>- 



and the Protein Prospector, Peptide Search and ProFound programs described above. 
Therefore, adding the sequence of a gene and its predicted protein sequence and 
peptide fragments to a sequence database can aid in the identification of unknown 
proteins using tandem mass spectrometry. 

Thus, there also exists a need in the art for polypeptides suitable for use in 
peptide fragmentation studies, for use in molecular weight measurements, and for use 
in protein sequencing using tandem mass spectrometry. 



The invention aids in fulfilling these various needs in the art by providing 
isolated, novel SVPH nucleic acids and polypeptides encoded by these nucleic acids. 
Particular embodiments of the invention are directed to an isolated SVPH nucleic acid 
molecule comprising the DNA sequence of SEQ ID NOs:l-3 and isolated SVPH 
nucleic acid molecules encoding the amino acid sequence of SEQ ID NOs:4-6, as well 
as nucleic acid molecules complementary to these sequences. Further studies have 
revealed the full-length nucleotide sequences of three alternatively spliced SVPF1-1 
clones (SEQ ID NOs:7-9) and two alternatively spliced SVPH 4 clones (SEQ ID 
NOs: 10-11). Thus, further embodiments of the invention are directed to an isolated 
SVPH nucleic acid molecule comprising the DNA sequence of SEQ ID NOs:7-l 1 and 
isolated SVPH nucleic acid molecules encoding the amino acid sequence of SEQ ID 
NOs: 12-16, as well as nucleic acid molecules complementary to these sequences. 
Both single-stranded and double-stranded RNA and DNA nucleic acid molecules are 
encompassed by the invention, as well as nucleic acid molecules that hybridize to a 
denatured, double-stranded DNA comprising all or a portion of SEQ ID NOs: 1 -3 and 
7-11. Also encompassed are isolated nucleic acid molecules that are derived bv in 
vitro mutagenesis of nucleic acid molecules comprising sequences of SEQ ID NOs: 1- 
3 and 7-11. that are degenerate from nucleic acid molecules comprising sequences of 

niui^K ^.u ii^.cai:, - ui ia «M v. u i> Man, \ oi UaiiMent ;\ trans 1 on ned or trans tec ted 
\\ ith these vectors. 



SUMMARY OF THE INVENTION 
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In addition, the invention encompasses methods of using the nucleic acids 
noted above to identify nucleic acids encoding proteins having metalloproteinase- 
disintenrin activities; to identify human chromosome number 1 or 4; to map genes on 
human chromosome number lor 4; to identify genes associated with certain diseases, 
syndromes, or other human conditions associated with human chromosome number 1 
or 4; and to study proteinases and their activities on cell/cell interactions as well as 
proteinase activity on the immune system. 

The invention also encompasses the use of sense or antisense oligonucleotides 
from the nucleic acid of SEQ ID NOs:l-3 and 7-11 to inhibit the expression of the 
polynucleotides encoded by the SVPH-1, SVPH-3, or SVPH-4 genes. 

The invention also encompasses isolated polypeptides and fragments thereof 
encoded by these nucleic acid molecules including soluble polypeptide portions of 
SEQ ID Nos:4-6 and 12-16. The invention further encompasses methods for the 
production of these polypeptides, including cultunng a host cell under conditions 
promoting expression and recovering the polypeptide from the culture medium. 
Especially, the expression of these polypeptides in bacteria, yeast, plant, insect, and 
animal cells is encompassed by the invention. 

In general, the polypeptides of the invention can be used to study the cell/cell 
and cell/matrix interactions involved in cellular processes (including cell fusion as in 
sperm/egg interactions, cell recognition and binding) as well as those involved in the 
immune system. In addition, these polypeptides can be used to identify other proteins 
associated with SVPH family members. ADAMs family members, and other 
metal loprotcinases. 

In addition, the invention includes assays utilizing these polypeptides to screen 
for potential inhibitors of activity associated with polypeptide counter-structure 
molecules, and methods of using these polypeptides as therapeutic agents for the 
treatment of diseases mediated by SVPH polypeptide counter-structure molecules. 

. . _ a h <; . i i. i i ims*.] [ ■ ■ ■ • > v . , - i . il u i^u jo; ;lmi ig in esc poi \ peptides as 

molecular weight markers that allow the estimation of the molecular weiuht of a 
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protein or a fragmented protein, as well as a method for the visualization of the 
molecular weight markers of the invention thereof using electrophoresis. The 
invention further encompasses methods for using the polypeptides of the invention as 
markers for determining the isoelectric point of an unknown protein, as well as 
controls for establishing the extent of fragmentation of a protein. Further 
encompassed by this invention are kits to aid in these determinations. 

Isolated polyclonal or monoclonal antibodies that bind to these polypeptides 
are also encompassed by the invention, in addition the use of these antibodies to aid in 
purifying the SVPH polypeptide. 

Further encompassed by this invention is the use of the SVPH nucleic acid 
sequences, predicted amino acid sequences of the polypeptide or fragments thereof, or 
a combination of the predicted amino acid sequences of the polypeptide and fragments 
thereof for use in searching an electronic database to aid in the identification of 
sample nucleic acids and/or proteins. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts a Northern blot hybridization showing the tissue specificity of 
SVPH-1 and SVPH-4. 

Figure 2 depicts a phylogenetic tree of metalloproteinase-disintegrins. 
Branches marked with heavy lines indicate ADAM family members with a consensus 
zinc-binding motif (HEXXHXXGXXHD) (SEQ ID NO:31). The arrow indicates the 
probable zinc-binding motif containing common ancestor. Lineages in which the 
zinc-binding site was subsequently lost are denoted with an 'X'. Species 
abbreviations: Mm, Mas tmisculus; Rn, Rutins norvcgicus; Hs, Homo sapiens; Mf, 
Macaca jasciciduns; Oc, Oryctolagus cunicidus\ Cc, Cavia cobava, Cp, Cavia 
porcellus\ So, Sagianus ocdipus\ Pp, Pongo pygmacus\ Bt, Bos taurus. 
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Name: SVPH-1 

1 ATTTTTGATA CCACAGTGAC CAACACGGTC ACCTAAGGTG TTCAATTCTT 
5 1 TGTAGCAAGT CTCACTTGCA GTATTTGCGC CTGCACCAAA AATCCTCCTA 
103 CACTGTTCAN TTGCGGTCAT GACANGCTC (SEQ ID NO : 1 ) 



Name: SVPH-3 

1 TTTTTGAGTA AGAATAGGTC ATGTTTTAGT AAAACTTCCA AAAGAACAAA 

51 ACAGATTCTT CAACGCAGGA GGACATGTGA GTCACAATAC CGTTTAATCC 

101 ACAGGTTGGC TCCTTGGTTT CTGGAACTTT CTGCGTGGTG TAAACGATGT 

151 GCGGGTGGTA CCCTGCCTCA ACCAGTGGAT GCTTCTTCAC GGGTTCAATG 

2 01 AAAAAGTGTC CATGTGGTAG TTGGAAAAAT CCAGTCAGTC GATGGCAGGC 
25 1 ACTGAGGGCT GCCGTCCCAA CTCTGGTGCC CTGCTGTAGA ACGGTGCCAG 

3 01 TGAGATGGCA GAGGGGGGCA GAGGAAGCCA TCATCTTAAC ATGGGAGAGG 

3 51 TTGCCATATC TCTTCTCCAT GATGTAGCTA TTGGAAAGAA ATCCTTCATT 

4 01 GACCGTCAAG TTAAAAAACA GGTCCTTGTC CTCGTGAGAA ATTCTGTAGT 
451 ACACCCAGTC CTCTGAGCC (SEQ ID NO : 2 ) 



Name: SVPH-4 



i 

51 
101 
151 

201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
75 1 

8 0 1 
& 5 1 

9 01 
951 

1001 
1051 
13 01 
1151 
12 01 



CACGAGGATT 
GTTTGGAAAA 
TAGATACAAA 
CATGCTGTAG 
GCCTAATTGC 
ATATCTCTTT 
ATCCCAGGAC 
GGAGAATGAG 
GGTGTTGCCA 
GGACTTTGCT 
GCAGGAAGGA 
GTTCCTGCCC 
GAAGGCCGTT 
AAGCATTTTT 
CAGTTAACTT 
CGAAATTTTA 
GTGTATAAAT 
TTTGTACTCA 
CATCTATCCA 
TGGCACCTCC 
ATAGCTCAGT 
GGTGTTTGGA 
TCCATTCTGT 
CAGGACTGCT 
ATAATGTTTC 



TATATCTTCA 
GTGTGTTGTC 
TATCCTTGCC 
GAATGTCAOA 
ATCATGGGCT 
TTTTAAACAT 
TAGGTTATGT 
GAATGTGATT 
ATCAAATTGT 
GTCATGATTG 
AATGAATGTG 
AAATGACGTT 
GTTTCAGGAA 
GGACCTGATG 
AATAGGTGAT 
AAAAGTGTGA 
GTTGAAACCA 
TTTACAGGCA 
TGAAACCCAT 
TGTGGAGAAG 
CGTGCAGTTT 
ACAACAGAAA 
GAGGAAGTGG 
CAGAGGGGCG 
GG 7TTATTTT 



AAGAAAATAT 
TAGAATATGG 
CCTGGTACGT 
TGATGAACAA 
CAGGACGGAC 
ATCTGTTGGG 
GCTTAAGAGA 
GTGGTTCCAC 
AAGTTGCAAC 
TCGGTTTCGT 
ACCTTGCAGA 
TATAAGCAGG 
GGGGTGCAGA 
GGATGGAGGG 
CAATTTGGTA 
AAGTGCAAAT 
TCCCTGATTT 
GAAAATCTCA 
GGGAATACCT 
GCCGGGTATG 
GACTGTTTGC 
AAAGTGCCAC 
GGTATGGAGG 
ATTCCCTC 3T 
ATTAATCCTT 



AATGATGCTG 
TGGATCAGTG 
GGTCTGCTCA 
TAGTGCCAAT 
TGGGTTTAGG 
GAGCAACATG 
TGTGGAAACA 
AGAGGAGTGT 
CAGGTGGCAA 
GCATGTGGAT 
GTACTGCGAC 
ATGGAAGCCC 
TCCAGATATA 
TCCTAGTGAG 
A2TGTGAGAT 
TCAATATGTG 
GOCAGAGCAT 
TGTGCTGGGG 
GACCTAGGTA 
TTTTAAAAAA 
CTGAGAAATG 
TGCATGTATG 
AAGCATTGAC 
2AATTTGGGT 
"T O T O -5 r T"T"T" 



TTGCATGGTC 
AGTACTTTAC 
TGAGCTGGGT 
GTAGGGGTAG 
AATTGCAGTT 
TCTAAATAAT 
AAATTGTGGA 
CAGAAAGATC 
CTGTAGCATT 
ACGTGTGTAG 
GGGAATTCAA 
TTGCAAGTAT 
TGCAGTGGCA 
TGCTATGATG 
TACAGGAATT 
GCAGGCTACA 
ACGACTATAA 
CACAGGCTAT 
TGATAAATGA 
AATTGCGTGA 
CAATACCGGG 
GGTGGGCACC 
AGTGGGCCTC 
TGTGTCCATC 
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Name: SVPH-la 



1 


ATGAAGATGT 


TACTCCTGCT 


GGATTGGCTT 


GGGGTGTTTG 


TGTCCTGTTC 


51 


TGGACACATC 


GAGGATGAGC 


ACGCCGAATA 


TCACAGCGCT 


CCGGATGTGG 


101 


TGATTCCTGT 


GAGGATAACT 


GGCACCACCA 


G AGvj CATG AC 


AGCTCGAGGC 


151 


TGGCTCTCCT 


ATATGGTGCC 


GTTTGGAGGC 


CAGAAACACA 


TTATCCACAT 


201 


AAAGGTCAAG 


AAGCTTTTGT 


TTTGCAAACA 


GGTGCCTGTG 


TTGACCTAGA 


251 


CAGACCAGGG 


TGCTATCCTT 


GAGGACCAGC 


CATTTGTCCA 


GAATAAGTGG 


301 


TACTATCATG 


GTTATGTGGA 


AGGGGACCCA 


GAATCCCTGG 


TTTCCCTCAG 


351 


TACCTGTTTT 


GGGGGTTTTC 


AAGGAATATT 


ACAGATAAAT 


GACTTTGCTT 


4 01 


ATGAAATCAA 


GCCCCTAGCA 


TTTTGTAGCA 


CGTTTGAACA 


TCTGGTATAC 


451 


AAGATGGACA 


GTGAGGAGAA 


AGAATTTTGA 


ACCATGAGAT 


CCGGATTTAT 


501 


GCAAAATGAA 


ATAACATGCC 


GAATGGAATT 


TGAAGAAATT 


GATAATTCCA 


551 


CTCAGAAGCA 


AAGTTCTTAT 


GTGGGCTGGT 


GGATCCATTT 


TAGGATTGTT 


601 


GAAATTGTAG 


TCGTGATTGA 


TAATTATCTG 


TACATTCGTT 


ATGAAAGGAA 


651 


CGACTCAAAG 


TTGGTGGAGG 


ATCTATATGT 


TATTGTTAAT 


ATAGTGGATT 


7 0 1 


CCATTTTGGA 


TGTCATTGGT 


GTTAAGGTGT 


TATTATTTGG 


TTTGGAGATC 


TCI 


r^, ^ C A C G AA T A 


a AAACCTCAT 


1X1 G TAG TAG AT 




AATGTGTGGA 


801 


CCTGTATTGC 


AAGTGGAAGT 


GGGAGAAGAT 


TACGCGCCGG 


ATGCAACATG 


851 


ACACCTCAGA 


TCTTTTCACA 


ACTGTAGGAT 


TAAGAGGGTT 


AAG T G G CAT A 


901 


GGAGCTTTTA 


GAGGAATGTG 


TAGAGGAGAC 


CGTAGTTGTG 


CAATTGTTAC 


951 


TTTCATGAAC 


AAAAGTTTGG 


GCAGTTTTTC 


AATTGCAGTG 


GCTCATCATC 


1001 


TAG G T C AT AA 


TTTGGGCATG 


AAGCATGATG 


AGGATACATG 


TCGTTGTTCA 


1051 


CAACCTAGAT 


GCATAATGCA 


TGAAGGCAAC 


CCACCAATAA 


CTAAATTTAG 


1101 


CAATTGTAGT 


TATGGTGATT 


TTTGGGAATA 


TACTGTAGAG 


AGGACAAAGT 


1151 


GTTTGGTTGA 


AACAGTAGAC 


AGAAAGGACA 


TCTTTAATGT 


GAAGCGCTGT 


1201 


GGGAATGGTG 


TTGTTGAAGA 


AGGAGAAGAG 


TGTGACTGTG 


GACCTTTAAA 


1251 


GCATTGTGCA 


AAAGATGCCT 


GCTGTCTGTG 


AAATTGCACT 


CTGACTGATG 


13 01 


GTTCTACTTG 


TGGTTTTGGG 


GTTTGTTG C A 


AAGACTGCAA 


GTTGCTACCA 


1351 


TCAGGGAAAG 


TGTGTAGAAA 


GGAGGTGAAT 


GAATGTGATC 


TTGCAGAGTG 


14 01 


GTGCAATGGT 


ACTTGCCATA 


AGTGCCCAGA 


TGACTTTTAT 


GTGGAAGATG 


1451 


GAATTCCCTG 


TAAGGAGAGG 


GGCTACTGCT 


ATGAAAAGAG 


CTGTCATGAC 


1501 


CGC AATGAAC 


AGTGTAGGAG 


GATTTTTGGT 


GCAGGCGCAA 


ATACTGCAAG 


1551 


TGAGACTTGC 


TACAAAGAAT 


TGAACACCTT 


AGGTGACGGT 


GTTGGTCACT 


1601 


GTGGTATGAA 


AAATGGTACA 


TATATAAAGT 


GTAATATGTC 


AGATGTGCAG 


1651 


TGTGGAAGAA 


TTGAGTGTGA 


GAATGTGACA 


GAAATTCCGA 


ATATGAGTGA 


1701 


TCATACTACT 


GTGGATTGGG 


CTCGCTTCAA 


TGACATAATG 


TGCTGGAGTA 


175 : 


CTGATTACGA 


TTTGGGGATG 


AAGGGACCTG 


ATATTGGTGA 


AGTGAAAGAT 


18 0 1 


GGAACAGAGT 


GTGGGATAGA 


TCATATATGC 


ATCCACAGGC 


ACTGTGTCCA 


18 51 


TATAACCATC 


TTGAATAGTA 


ATTGCTCACC 


TGCATTTTGT 


AACAAGAGGG 


1 9 0 1 


GCATCTGCAA 


C AAT AAA GAT 


C -\ C T 3 C AT T 


G C AAT TAT G T 


GTGGGACCCT 


1951 


CCCAACTGCC 


TGATAAAAGG 


(2TATGGAGGT 


AGTGTTGACA 


GTGGCCCACC 


2 00 1 


CCCTAAGAGA 


AAGAAGAAAA 


AGAAGTTGTG 


TTATCTGTGT 


ATATTGTTGC 


2051 


TTATTGTTTT 


GTTTATTTTA 


TTATGTTGTG 


TTTATCGACT 


TTGTAAAAAA 


2101 


AGTAAAGCAA 


TAAAAAAGCA 


GCAAGATGTT 


CAAACTCCAT 


CTGCAAAAGA 


2 151. 


AGAGGAAAAA 


ATTCAGGGTC 


GACCTCATGA 


GTTACCTCCC 


CAGAGTCAAC 


22 0 1 


CTTGGGTGAT 


GCCTTCGCAG 


AGTCAACCTC 


CTGTGACACG 


CTGGCAGAGG 


2 25 1 


GAACCTCAGT 


TGATGCCTTC 




CCTCCTGTGA 


CGCCCTCCTA 


-j r\ ^ 


/ qpn t T) 


f;n . ' 
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1C1 


TGATTCCTGT 


GAGGATAACT 


GGGACCACCA 


GAGGCATGAC 


ACCTCCAGGC 


151 


TGGCTCTCCT 


ATATCCTGCC 


CTTTGGAGGC 


CAGAAACACA 


TTATGCACAT 


2 01 


AAAGGTCAAG 


AAGGTTTTGT 


TTTCCAAACA 


CCTCCGTGTG 


TTCACCTACA 


251 


GAGACCAGGG 


TGGTATCGTT 


GAGGACGAGC 


CATTTGTCCA 


GAATAACTGC 


301 


TACTATCATG 


GTTATGTGGA 


AGGGGACCCA 


GAATCCGTGG 


TTTCCCTCAG 


351 


TACCTGTTTT 


GGGGGTTTTC 


AAGGAATATT 


ACAGATAAAT 


GACTTTGCTT 


401 


ATGAAATCAA 


GCCCCTAGCA 


TTTTCTACCA 


GGTTTGAACA. 


TGTGGTATAG 


451 


AAGATGGACA 


GTGAGGAGAA 


ACAATTTTCA 


AGCATGAGAT 


CGGGATTTAT 


501 


GCAAAATGAA 


ATAACATG2C 


GAATGGAATT 


TGAAGAAATT 


GATAATTCGA 


551 


CTCAGAAGCA 


AAGTTGTTAT 


GTGGGCTGGT 


GGATCCATTT 


TAGGATTGTT 


601 


GAAATTGTAG 


TCGTCATTGA 


TAATTATCTG 


TACATTGGTT 


ATGAAAGGAA 


651 


CGACTCAAAG 


TTGCTGGAGG 


ATCTATATGT 


TATTGTTAAT 


ATAGTGGATT 


7 01 


CCATTTTGGA 


TGTCATTGGT 


GTTAAGGTGT 


TATTATTTGG 


TTTGGAGATC 


751 


TGGACCAATA 


AAAACCTGAT 


TGTAGTAGAT 


GATGTAAGGA 


AATCTGTGCA 


801 


CCTGTATTGC 


AAGTGGAAGT 


CGGAGAAGAT 


TACGCCCCGG 


ATGCAACATG 


851 


ACACCTCAGA 


TCTTTTCACA 


ACTCTAGGAT 


TAAGAGGGTT 


AAGTGGCATA 


901 


GGAGCTTTTA 


GAGGAATGTG 


TACACCAGAC 


CGTAGTTGTG 


CAATTGTTAG 


95 1 


TTTGATGAAG 


AAAACTTTCIG 


GGAPTTTTTP 


AATTGCAGTG 


GCTCA.TCATC 


1001 


TAGGTCATAA 


TTTGGGCATG 


AACCATGATG 


AGGATACATG 


TCGTTGTTCA 


1051 


GAACCTAGAT 


GCATAATGCA 


TGAAGGCAAC 


CCACCAATAA 


CTAAATTTAG 


1101 


CAATTGTAGT 


TATGGTGATT 


TTTGGGAATA 


TACTGTAGAG 


AGGACAAAGT 


1151 


GTTTGCTTGA 


AACAGTACAC 


ACAAAGGACA 


TCTTTAATGT 


GAAGCGCTGT 


1201 


GGGAATGGTG 


TTGTTGAAGA 


AGGAGAAGAG 


TGTGACTGTG 


GACCTTTAAA 


1251 


GCATTGTGGA 


AAAGATCCCT 


GCTGTGTGTC 


AAATTGCACT 


CTGACTGATG 


1301 


GTTCTACTTG 


TGCTTTTGGG 


CTTTGTTGCA 


AAGACTGCAA 


GTTCGTACCA 


1351 


TCAGGGAAAG 


TGTGTAGAAA 


GGAGGTCAAT 


GAATGTGATG 


TTCCAGAGTG 


1401 


GTGCAATGGT 


ACTTCCCATA 


AGTGCCCAGA 


TGACTTTTAT 


GTGGAAGATG 


1451 


GAATTCCCTG 


TAAGGAGAGG 


GGCTACTGGT 


ATGAAAAGAG 


CTGTCATGAG 


1501 


CGCAATGAAC 


AGTGTAGGAG 


GATTTTTGGT 


GCAGGCGCAA 


ATACTGCAAG 


1551 


TGAGACTTGC 


TACAAAGAAT 


TGAACACGTT 


AGGTGACCGT 


GTTGGTCAGT 


1601 


GTGGTATCAA 


AAATGCTACA 


TATATAAAGT 


GTAATATGTC 


AGATGTCCAG 


1651 


TGTGGAAGAA 


TTCAGTGTGA 


GAATGTGACA 


GAAATTCCCA 


ATATGAGTGA 


1701 


TCATACTACT 


GTGCATTGGG 


CTCGCTTCAA 


TGACATAATG 


TGCTGGAGTA 


1751 


CTGATTACCA 


TTTGGGGATG 


AAGGGACCTG 


ATATTGGTGA 


AG TG AAAG AT 


1801 


GGAACAGAGT 


GTGGGATAGA 


TCATATATGC 


ATCCACAGGG 


ACTGTGTCCA 


1851 


TATAAGCATC 


TTGAATAGTA 


ATTGCTCACC 


TGCATTTTGT 


AACAAGAGGG 


1901 


GGATGTGCAA 


CAATAAACAT 


CACTGCCATT 


GCAATTATCT 


GTGGGACGCT 


1951 


CCCAACTGCC 


TGATAAAAGG 


CTATGGAGGT 


AGTGTTGAGA 


GTGGTCCACC 


2 0 0 1 


CCGTAAGAGA 


AAGAAGAAAA 


AGAAGTTCTG 


TTATCTGTGT 


ATATTGTTGC 


2 05 1 


TTATTGTTTT 


GTTTATTTTA 


TTATGTTGTC 


TTTATCGACT 


TTGTAAAAAA 


2101 


AGT AAAGGAA 


TAAAAAAG Ca 


G 2AAGATGTT 


CAAACTCCAT 


CTGCAAAAGA 


215 1 


AGAGGAAAAA 


ATTCAGCGTC 


GACCTCATGA 


GTTACCTCCC 


CAGAGTCAAC 


220 1 


GTTGGGTGAT 


CCCTTCCCAG 


A3TCAACCTC 


CTGTGACGCC 


TTCCCAGAGT 


225 1 


CATGGTCAGG 


TGATGCCTTC 


CCAGAGTGAA 


CGTCGTCAAA 


ATTTATTCGT 


23 0 1 


GTTCAGCTTC 


TCAATGAGTG 


ACTGTGTGCT 


AAATTTTAGG 


CTACTGTATC 


2351 


TTCAGGCCAC 


CTGA (SEQ 


ID NO : 8 ) 
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25 1 CAGACCAGGG TGCTATCCTT GAGGACCAGC CATTTGTCCA GAATAACTGC 

3 01 TACTATCATG GTTATGTGGA AGGGGAGGCA GAATGCGTGG TTTCCCTCAG 

35 1 TACGTGTTTT GGGGGTTTTC AAGGAATATT ACAGATAAAT GACTTTGCTT 

401 ATGAAATCAA GCCCCTAGCA TTTTCTACCA CGTTTGAACA TCTGGTATAC 

45 1 AAGATGGACA GTGAGGAGAA ACAATTTTCA ACCATGAGAT CCGGATTTAT 

50 1 GCAAAATGAA ATAACATGCC GAATGGAATT TGAAGAAATT GATAATTCCA 

55 1 CTCAGAAGCA AAGTTCTTAT GTGGGGTGGT GGATCCATTT TAGGATTGTT 

60 1 GAAATTGTAG TCGTCATTGA TAATTATCTG TACATTCGTT ATGAAAGGAA 

65 1 CGACTCAAAG TTGCTGGAGG ATCTATATGT TATTGTTAAT ATAGTGGATT 

70 1 CCATTTTGGA TGTCATTGGT GTTAAGGTGT TATTATTTGG TTTGGAGATC 

751 TGGACCAATA AAAACCTCAT TGTAGTAGAT GATGTAAGGA AATCTGTGCA 

801 CCTGTATTGC AAGTGGAAGT CGGAGAACAT TACGCCGCGG ATGCAACATG 

851 ACACCTCACA TCTTTTCACA ACTCTAGGAT TAAGAGGGTT AAGTGG CAT A 

90 1 GGAGCTTTTA GAGGAATGTG TACACCACAC CGTAGTTGTG CAATTGTTAC 

95 1 TTTCATGAAC AAAACTTTGG GCACTTTTTC AATTGCAGTG GCTCATCATC 

100 1 TAGGTCATAA TTTGGGCATG AACCATGATG AGGATACATG TCGTTGTTCA 

1051 CAACCTAGAT GCATAATGCA TGAAGGCAAC CCACCAATAA CTAAATTTAG 

110 1 CAATTGTAGT TATGGTGATT TTTGGGAATA TACTGTAGAG AQGACAAAGT 

115 1 GTTTGCTTGA AACAGTACAC ACAAAGGACA TCTTTAATGT GAAGCGCTGT 

12 01 GGGAATGGTG TTGTTGAAGA AGGAGAAGAG TGTGACTGTG GACCTTTAAA 

12 51 GCATTGTGCA AAAGATCCCT GCTGTCTGTC AAATTGCACT CTGACTGATG 

13 0 1 GTTCTACTTG TGCTTTTGGG CTTTGTTGCA AAGACTGCAA GTTCCTACCA 

13 51 TCAGGGAAAG TGTGTAGAAA GGAGGTCAAT G AATGTG AT C TTCCAGAGTG 

14 01 GTGCAATGGT ACTTCCCATA AGTGCCCAGA TGACTTTTAT GTGGAAGATG 

14 51 GAATTCCCTG TAAGGAGAGG GGCTACTGCT ATGAAAAGAG CTGTCATGAC 

15 0 1 CGCAATGAAC AGTGTAGGAG GATTTTTGGT GCAGGCGCAA ATACTGCAAG 

15 51 TGAGACTTGC TACAAAGAAT TGAACACCTT AGGTGACCGT GTTGGTCACT 

16 01 GTGGTATCAA AAATGCTACA TATATAAAGT GTAATATCTC AGATGTCCAG 
1651 TGTGGAAGAA TTCAGTGTGA GAATGTGACA GAAATTCCCA ATATGAGTGA 

17 01 TCATACTACT GTGCATTGGG CTCGCTTCAA TGACATAATG TGCTGGAGTA 

17 51 CTGATTACCA TTTGGGGATG AAGGGACCTG ATATTGGTGA AGTGAAAGAT 
1801 GGAACAGAGT GTGGGATAGA TCATATATGC ATCCACAGGC ACTGTGTCCA 

18 51 TATAACCATC TTGAATAGTA ATTGCTCACC TGCATTTTGT AACAAGAGGG 
1901 GCATCTGCAA CAATAAACAT CACTGCCATT GCAATTATCT GTGGGACCCT 
195 1 CCCAACTGCC TGATAAAAGG CTATGGAGGT AGTGTTGACA GTGGCCCACC 
2001 CCCTAAGAGA AAGAAGAAAA AGAAGTTCTG TTATCTGTGT ATATTGTTGC 
2 05 1 TTATTGTTTT GTTTATTTTA TTATGTTGTC TTTATCGACT TTGTAAAAAA 
2 101 AGTAAACCAA TAAAAAAGCA GCAAGATGTT CAAACTCCAT CTGCAAAAGA 
2 15 3 AGAGGAAAAA ATTCAGCGTC GACCTCATGA GTTACCTCCC CAGAGTCAAC 
22 01 CTTGGGTGAT GCCTTCCCAG AGTCAACCTC CTGTGACGCC TTCCCAGAGT 

22 5 1 CATCCTCGGG TGATGCCTTC TCAGAGTCAA CCTCCTGTGA T3CCTTCCCA 

23 0 1 GAGTCATCCT CAGTTGACGC CTTCCCAGAG TCAACCTCCT GTGATGCCTT 
2 351 CCCAGAGTCA TCCTCAGTTG ACGCCTTCGC AGAGTCAACC TCCTGTGACA 

24 01 CCCTCCCAGA GGCAACCTCA GTTGATGCCT TCCCAGAGTC AACCTCCTGT 
2451 GACGCCCTCC TAG (SEQ ID NO : 9 ) 



Name: SYPH-4a 
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3 01 


GAGGATCATC 


CTTACATACC 


AAAGGACTGC 


AACTACATGG 


GCTCCGTGAA 


3 5 1 


AGAGTCTCTG 


GACTCTAAAG 


CTACTATAAG 


CACATGCATG 


GGGGGTCTCC 


401 


GAGGTGTATT 


TAACATTGAT 


GCCAAACATT 


ACCAAATTGA 


GCCCCTCAAG 


4 51 


GCCTCTCCCA 


GTTTTGAACA 


TGTCGTCTAT 


CTCCTGAAGA 


AAGAGCAGTT 


5 01 


TGGGAATCAG 


GTTTGTGGCT 


TAAGTGATGA 


TGAAATAGAA 


TGGCAGATGG 


551 


CCCCTTATGA 


GAATAAGGCG 


AGGCTAAGGG 


ACTTTCCTGG 


ATCCTATAAA 


601 


CACCCAAAGT 


ACTTGGAATT 


GATCCTACTC 


TTTGATCAAA 


G TAG G TAT AG 


G51 


GTTTGTGAAC 


AACAATCTTT 


CTCAAGTCAT 


ACATGATGCC 


ATTCTTTTGA 


701 


CTGGGATTAT 


GGACACCTAC 


TTTCAAGATG 


TTCGTATGAG 


GATACACTTA 


7 5 1 


AAGGCTCTTG 


AAGTATGGAC 


AGATTTTAAC 


AAAATACGCG 


TTGGATATCC 


801 


AG AGTTAG CT 


GAAGTTTTAG 


GCAGATTTGT 


AATATATAAA 


AAAAGTGTAT 


851 


TAAATGCTCG 


CCTGTCATCA 


GATTGGGCAC 


ATTTATATCT 


TCAAAGAAAA 


901 


TATAATGATG 


CTCTTGCATG 


GTCGTTTGGA 


AAAGTGTGTT 


CTCTAGAATA 


951 


TGCTGGATCA 


GTGAGTACTT 


TACTAGATAC 


AAATATCCTT 


GCCCCTGCTA 


1001 


CGTGGTGTGC 


TCATGAGCTG 


GGTCATGCTG 


TAGGAATGTC 


ACATGATGAA 


1051 


CAATACTGCC 


AATGTAGGGG 


TAGGCCTAAT 


TGCATCATGG 


GCTCAGGACG 


1101 


CACTGGGTTT 


AGCAATTGCA 


GTTATATCTC 


TTTTTTTAAA 


CATATCTCTT 


1151 


CGGGAGGAAC 


ATGTCTAAAT 


AATATCCCAG 


GACTAGGTTA 


T G T G C T T AA C- 


1201 


AGATGTGGAA 


ACAAAATTGT 


GGAGGACAAT 


GAGGAATGTG 


ATTGTGGTTC 


12 5 1 


CACAGAGGAG 


TGTCAGAAAG 


ATCGGTGTTG 


CCAATCAAAT 


TGTAAGTTGC 


13 0 1 


AACCAGGTGC 


CAACTGTAGC 


ATTGGACTTT 


GCTGTCATGA 


TTGTCGGTTT 


1351 


CGTCCATCTG 


GATACGTGTG 


TAGGCAGGAA 


GGAAATGAAT 


GTGACCTTGC 


1401 


AGAGTAGTGC 


GACGGGAATT 


CAAGTTCCTG 


CCCAAATGAC 


GTTTATAAGC 


1451 


AGGATGGAAG 


CCCTTGCAAG 


TATGAAGGCC 


GTTGTTTCAG 


GAAGGGGTGC 


1501 


AGATCCAGAT 


ATATGCAGTG 


CCAAAGCATT 


TTTGGACCTG 


ATGCCATGGA 


1551 


GGCTCCTAGT 


GAGTGCTATG 


ATGCAGTTAA 


CTTAATAGGT 


GATCAATTTG 


1601 


GTAACTGTGA 


GAT T A C AG G A 


ATTCGAAATT 


TTAAAAAGTG 


TGAAAGTGCA 


1651 


AATTCAATAT 


GTGGCAGGCT 


ACAGTGTATA 


AATGTTGAAA 


CCATCCCTGA 


1701 


TTTGCCAGAG 


CATACGACTA 


TAATTTCTAC 


TCATTTACAG 


GCAGAAAATC 


1751 


TCATGTGCTG 


GGGCACAGGC 


TATCATCTAT 


CCATGAAACC 


CATGGGAATA 


1801 


CCTGACCTAG 


GTATGATAAA 


TGATGGCACC 


TCCTGTGGAG 


AAGGCCGGGT 


1851 


ATGTTTTAAA 


AAAAATTGCG 


TCAATAGCTC 


AGTCCTGCAG 


TTTGACTGTT 


1901 


TGCCTGAGAA 


ATGCAATACC 


CGGGGTGTTT 


GCAACAACAG 


AAAAAACTGC 


1951 


CACTGCATGT 


ATGGGTGGGC 


ACCTCCATTC 


TGTGAGGAAG 


TGGGGTATGG 


2 001 


AGGAAGCATT 


GACAGTGGGC 


CTCCAGGACT 


GCTCAGAGGG 


GCGATTCCCT 


2051 


CGTCAATTTG 


GGTTGTGTCC 


ATCATAATGT 


TTCGCCTTAT 


TTTATTAATC 


2101 


CTTTCAGTGG 


TTTTTGTGTT 


TTTCCGGCAA 


GTGATAGGAA 


ACCACTTAAA 


2151 


ACCCAAACAG 


G AAAAAATG C 


CACTATCCA^ 


AGCAAAAACT 


GAACAGGAAG 


2 2 C 1 


AATCTAAAAC 


AAAAACTGTA 


GAGGAAGAAT 


CTAAAACAAA 


AACTGGACAG 


22 51 


GAAGAATCTG 


AAGCAAAAAC 


TGGACAGGAA 


GAATCTAAAG 


CAAAAACTGG 


2 3 C' 1 


AC A GG AAGAA 


TCTAAAGCAA 


ACATTGAAAG 


T AAACG AC CC 


AAAGCAAAGA 


2 3 5 1 


GTGTCAAGAA 


A ( _ AAAAAAAG 


TAA (SEQ ID NO: 10 ) 





Name: SVPH-4b 

1 ATGAGGTCAG TGCAGATCTT CCTCTCCCAA TGCCGTTTGC TCCTTCTACT 
5 1 AGTTCCGACA ATGCTCCTTA AGTCTCTTGG CGAAGATGTA ATTTTTCACC 



I 



,-. 1TACAT 
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4 51 GCCTCTCCCA GTTTTGAACA TGTCGTCTAT CTCCTGAAGA AAGAGCAGTT 

501 TGGGAATCAG GTTTGTGGCT TAAGTGATGA TGAAATAGAA TGGCAGATGG 

551 CCGCTTATGA GAATAAGGCG AGGCTAAGGG ACTTTCCTGG ATCCTATAAA 

60 1 CACCCAAAGT ACTTGGAATT GATCCTACTC TTTGATCAAA GTAGGTATAG 

651 GTTTGTGAAC AAGAATCTTT CTCAAGTCAT ACATGATGCC ATTCTTTTGA 

7 01 CTGGGATTAT GGACACCTAC TTTCAAGATG TTCGTATGAG GATACACTTA 
751 AAGGGTCTTG AAGTATGGAC AGATTTTAAG AAAATAGGGG TTGGATATCC 

8 01 AGAGTTAGCT GAAGTTTTAG GCAGATTTGT AATATATAAA AAAAGTGTAT 
8 51 TAAATGCTCG CGTGTCATCA GATTGGGCAC ATTTATATCT TGAAAGAAAA 
90 1 TATAATGATG CTCTTGCATG GTGGTTTGGA AAAGTGTGTT GTCTAGAATA 
95 1 TGGTGGATGA GTGAGTACTT TACTAGATAC AAATATGCTT GCCCCTGCTA 

10 01 CCTGGCCTGC TCATGAGCTG GGTCATGGTG TAGGAATGTC ACATGATGAA 

1051 GAATAGTGCC AATGTAGGGG TAGGCTTAAT TGGATGATGG GCTCAGGACG 

1101 CACTGGGTTT AGCAATTGCA GTTATATCTC TTTTTTTAAA CATATCTCTT 

1151 CGGGAGCAAC ATGTGTAAAT AATATCCCAG GACTAGGTTA TGTGCTTAAG 

12 01 AGATGTGGAA ACAAAATTGT GGAGGACAAT GAGGAATGTG ACTGTGGTTC 

12 51 CACAGAGGAG TGTGAGAAAG ATCGGTGTTG CCAATCAAAT TGTAAGTTGC 

13 01 AACCAGGTGC CAAGTGTAGC ATTGGACTTT GGTGTGATGA TTGTCGGTTT 

13 51 CGTCGATCTG GATACGTGTG TAGGCAGGAA GGAAATGAAT GTGACCTTGC 

14 01 AGAGTACTGC GACGGGAATT CAAGTTCCTG CCCAAATGAG GTTTATAAGC 

14 51 AGGATGGAAC CCGTTGCAAG TATGAAGGCC GTTGTTTCAG GAAGGGGTGC 

15 01 AGATCCAGAT ATATGCAGTG CCAAAGCATT TTTGGAGGTG ATGCCATGGA 

15 51 GGCTCGTAGT GAGTGGTATG ATGCAGTTAA CTTAATAGGT GATCAATTTG 

16 01 GTAACTGTGA GATTACAGGA ATTCGAAATT TTAAAAAGTG TGAAAGTGCA 
1651 AATTCAATAT GTGGCAGGCT ACAGTGTATA AATGTTGAAA CGATCCCTGA 

17 01 TTTGCGAGAG CATACGAGTA TAATTTCTAC TCATTTACAG G C AG AAAATC 

17 51 TCATGTGCTG GGGGACAGGC TATGATCTAT CCATGAAACC CATGGGAATA 
1801 CCTGAGCTAG GTATGATAAA TGATGGCAGC TCCTGTGGAG AAGGCCGGGT 

18 51 ATGTTTTAAA AAAAATTGCG TCAATAGCTC AGTCCTGCAG TTTGACTGTT 
1901 TGCCTGAGAA ATGGAATAGC GGGGGTGTTT GCAACAACAG AAAAAACTGG 
195 1 CAGTGGATGT ATGGGTGGGC ACCTCCATTG TGTGAGGAAG TGGGGTATGG 
2 001 AGGAAGCATT GAGAGTGGGC GTCCAGGACT GCTGAGAGGG GCG ATTCCCT 
2 051 CGTCAATTTG GGTTGTGTCG ATGATAATGT TTCGGCTTAT TTTATTAATG 
2101 CTTTCAGTGG TTTTTGTGTT TTTCCGGCAA GTGATAGGAA ACCAGTTAAA 
2151 ACCCAAACAG GAAAAAATGG CAGTATGGAA AGCAAAAAGT GAACAGGAAG 
22 01 AATCTAAAAC AAAAACTGTA CAGGAAGAAT CTAAAACAAA AACTGGACAG 

22 51 GAAGAATCTG AAGCAAAAAG TGGACAGGAA GAATCTAAAG GAAACATTGA 

23 01 AAGTAAACGA CCCAAAGCAA AGAGTGTCAA GAAACAAAAA AAGTAA 

(SEQ ID NO: 11) 



The amino acid sequences of the polypeptides encoded by the nucleotide 
sequence of the invention includes: 



Name: SVPH-1 (polypeptide) 
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Name: SVPH-3 (polypeptide) 

1 EDWVYYRI SH EEKDLFFNLT VNEGFLSNSY IMEKRYGNLS HV KM MAS SAP 
51 LCHLSGTVLQ QGTRVGTAAL SACHGLTGFF QLPHGDFF1E PVKKHPLVEG 
101 GYHPHIVYRR Q KV P E T KE P T CGL ( SEQ ID NO : 5 ) 



Name: SVPH-4 (polypeptide) 

1 HEDLYLQRKY NDALAWSFCK VCSLEYAGSV STLLDTNILA PATWSAHELG 

51 HAVGMSHDEO YCQCRGRPNC IMGSGRTGFS NCSYISFFKH 1SSGATCLNN 

101 IPGLGYVLKR CGNKIVEDNE ECDCGSTEEC QKDRCCQSNC KLQPGANCSI 

151 GLCCHDCRFR PSGWCRQEG NECDLAE YCD GNSSSCPNDV YKQDGTPCKY 

201 EGRCFRKGCR SRYMQCQSTF GPDAMEAPSE CYDAVNLIGD QFGNCEITGI 

251 RNFKKCESAN SICGRLQCIN VETI PDLPEH TTIISTHLQA ENLMCWGTGY 

3 01 HLSMKPMCIP DLGMINDGTS CGEGRVCFKK NCVNSSVT.QF DCLPEKCNTR 

3 51 GVCNNRKNCH CMYGWAPPFC EEVGYGGS ID SGPPGLLRGA IPSSIWWSI 

4 01 IMFRLILLIL SWFVFFRQV IGNHLKPKQE KMPLSKAKTE OEESKTKTVQ 
4 51 EESKTKTGQE ESEAKTGQEE SKAKTGQEES KANIESKF.PK AKSVKKQKK* 

(SEQ ID NO : 6 ) 



Name: SVPH-la (polypeptide) 



i 

51 
101 
151 
201 
251 
301 
351 
4 01 
451 
501 
551 
601 
651 
7 01 
7 5 1 



MKMLLLLHCL 
WLSYILPFGG 
YYHGYVEGDP 
KMDSEEKQFS 
EI WVIDNYL 
WTNKNLI WD 
GAFRGMCTPH 
QPRCIMHEGN 
GNGWEEGEE 
SGKVCRKEVN 
RNEOCRRIFG 
CGRIQCENVT 
GTECGIDHI 2 
PJICLI KGYGG 
SKPIKKQQDY 
QPQLMPSQS 2 



GVFLSCSGHI 
QKHIIHIKVK 
ESLVSLSTCF 
TMRSGFMQNE 
YIRYERNDSK 
DVRKSVHLYC 
RSCAI VTFMN 
PPITKFSNCS 
CDCGPLKKCA 
ECDLPEWCNG 
AGANTASETC 
EIPNMSDHTT 
IHRHCVHIT1 
SVDSGPPPKF. 
QTPSAKEEEK 
PPVTPS* : £ 



QDEHPQYHSP 
KLLFSKHLPV 
GGFQGILQIN 
ITCRMEFEEI 
LLEDLWIVN 
KWKSENITPR 
K.TLGTFS 1 AV 
YGDFWEYTVE 
KJDPCCLSNCT 
TSHKCPDDFY 
YKELNTLGDR 
VHWARFND IM 
LNSNCSPAFC 
KKKKKFCYLC 
1 QRRPHELPP 
■DO J Ei NO : I! 



PDWIPWIT 
FTYTDQGAIL 
DFAYEIKPLA 
DNSTQKQSSY 
IVDSILDV1G 
MQHDTSHLFT 
AHHLGHNLGM 
RTKCLLETVH 
LTDGSTCAFG 
VEDGIPCKER 
VGHCGIKNAT 
CWSTDYHLGM 
NKRGICNNKH 
I LiLLIVLF 1 L 
QSQPWYMPSQ 



GTTRGMTPPG 
EDQ P F VQNNC 
FSTTFEHLVY 
VGWWIHFRIV 
VP2VLLFGLEI 
TLGLRGLSGI 
NHDEDTCRCS 
TP2D I FNVKRC 
LCCKDCKFLP 
GYCYEKSCHD 
YIKCNISDVQ 
KGPDIGEVKD 
HCHCNYLWDP 
LCCLYRLCKK 
SQPPVTP3QP 



Name: SVPH-lb (polypeptide) 



1 MKMLLLLHCL GVFLSCSGHI QDEHPQYHSP PDWIFVRIT GTTRGMTPPG 

5 1 WLSYILPFGG QKHIIHIKVK PVLLFSKHLPV FTYTDQGAI L EDQPFVQNNC 

101 YYHGYVEGDP ESLVSLSTCF GGFQGILOIN DFAYEIKPLA FSTTFEHLVY 

15 1 KMDSEEKQFS TMRSGFMQNE 1 TCRMEFEEI DNSTQKQSSY VGWWTHFP T 
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551 
601 
651 
701 
751 



cgri qcenvt ei pnmsdhtt 
gtecgidhic ihrhcvhiti 
pnclikgygg svdsgpppkf. kkkkkfcylc 
skpikkqqdv qtpsakeeek 



VHWARFNDTM CWSTDYHLGM KGPDIGEVKD 

LNSIJCSPAFC N KR G I C NN KH HCHCNYLWDP 

ILLLIVLFIL LCCLYRLCKK 

IQRRPHELPP QSQPWVMPSQ SQPPVTPSQS 



HPQVMPSQSQ PPQNLFLFSF SISDCVLNFR LLYLQAT* (SEQ ID NO : 1 3 



Name: SVPH-lc (polypeptide) 



51 
10 1 
151 
201 
251 
301 

3 51 

4 01 
4 51 
501 
551 
6 01 
651 
701 
751 
801 



mkmllllhc: 
wlsyilpfgg 
yyhgyvegdp 
kmdseekqfs 

EIW\ r IDNYL 
WTNKNLIWD 
GAFRGMCTPH 
QPRCIMHEGN 
GNGWEEGEE 
S G rTVC R KE VN 

rneqcrrifg 
cgri qcenvt 
gtecgidhic 
pnclikgygg 
skpikkqqdv 
hprvmpsqsq 
psqrqpqlmp 



. GVFLSCSGH1 
QKHI IHIKVK 
ESLVSLSTCF 
TMRSGFMQNE 
YIRYERNDSK 
DVRKSVHLYC 
RSCAIVTFMN 
PPITKFSNCS 
CDCGPLKHC7, 
ECDLPEWCNG 
AGANTASETC 
EI PNMSDHTT 
IHRHCVHITI 
SVDSGPPPKR 
QTPSAKEEEK 
PPVMPSQSHP 
SQSQPPVTPS 



QDEHPQYHS 
KLLFSKHLPV 
GGFQGILQIN 
ITCP.MEFEEI 
LLEDLYV I VN 
KWKSENITPR 
KTLGTFS I AV 
YGDFWEYTVE 
KDPCCLSNCT 
TSHKCPDDFY 

ykelntlgdr 
vhwarfndim 
lnsijcspafc 
kkkkkfcylc 
iqrrphelpp 
qltpsqsqpp 

(SEQ ID 



P PDWIPVRIT GTTRGMTPPG 
FTYTDQGAIL EDQPFVQNNC 
DFAYEI KPLA FSTTFEHLVY 



DNSTQKQSSY 
IVDSILDVJG 
MQHDTSHLFT 
AHHLGHNLGM 
RTKCLLETVH 
LTDGSTCAFG 
VEUGIPCKER 
VGHCG I KNAT 
CWSTDYHLGM 
NKRGICNNKH 
ILLLIVLFIL 
QSQPWVMPSQ 



VGWWIHFRIV 
VKVLLFGLEI 
TLGLRGLSGI 
NHDEDTCRCS 
TKDIFNVKRC 

lcckdckflp 
gycyekschd 
yikcnisdvq 
kgpdigevkd 
hchcnylwdp 
lcclyrlckk 
sqppvtpsqs 



VMPSQSHPQL TPSQSQPPVT 
NO : 14 ) 



Name: SVPH-4a (polypeptide) 



1 


MRSVQIFLSQ 


51 


FRGEVQGWS 


101 


EDHPYI PKDC 


151 


ASPSFEHWY 


201 


HPKYLELILL 


251 


KALEWTDFN 


301 


YNDALAWSFG 


351 


QYCQCRGRPN 


401 


RCGNKIVEDN 


451 


RPSGYVCRQE 


501 


RSRYMQCQSI 




NSICGRLQCI 


60 1 


PDLGMINDGT 


65 1 


HCMYGWAPPF 


701 


LSWFVFFRQ 


75 1 


EESEAKTGQE 



pvs yllqlk.g 
nymgsvkesl 
l.lkkeqfgnq 
fdqsryrfvn 
kirvgypela 
kvcsleyags 
cimgsgrtgf 
eecdcgstee 
gnecdlaevc 
fgpdameaps 
nvetipdlpe 
scgegrvcfk 
ceevgyggsi 
vignhlkpkq 
eskaktgqee 

( SEQ ID NO: 15) 



MLLKSLGEDV 
KKHVLHLWPK 
DSKATISTCM 
VCGLSDDEIE 
NNLSQVIHDA 
EVLGRFVI YK 
VSTLLDTNIL 
SNCSYISFFK 
CQKDRCCQSN 
DGNSSSCPNr 

ecydavnlic 
htt: isthlo 
kncvnssvlq 

DSGPPGLLRG 
EKMPLSKAKT 
SKANIESKRP 



I FHPEGEFDS 
RLLLPRHLRV 
GGLRGVFNID 
WQMAPYENKA 
I LLTG IMDTY 
KSVLNARLSS 
APATWSAHEL 
HISSGATCLN 
CKLQPGANCS 
VYKQDGTPCK 
DQFGNCEITG 
AENLMCWGTG 
FDCLPEKCNT 
AIPSSIWWS 
EQEESKTKTV 
KAKSVKKQKK 



YEVTI PEKLS 
FSFTEHGELL 
AKHYQIEPLK 
RLRDFPGSYK 
FQDVRMRIHL 
DWAHLYLQRK 
GHAVGMSHDE 
NIPGLGYVLK 
I GLCCHDCRF 
YEGRCFRKGC 
IRNFKKCESA 
YHLSMKPMG] 
RGVCNNRKNC 
I IMFRLILL1 
QEESKTKTGO 



Name: SVPH-4b (polypeptide) 
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251 KALEVWTDFN KIRVGYPELA EVLGRFVIYK KSVLNARLSS DWAHLYLQRK 

301 YNDALAWSFG KVCSLEYAGS VSTLLDTNIL APATWPAHEL GHAVGMSHDE 

351 QYCQCRGRLN CIMGSGRTGF SNCSYISFFK HISSGAT CLN NI PGLGYVLK 

401 RCGNKIVEDN EECDCGSTEE CQKDRCCQSN CKLQPGANCS IGLCCKDCRF 

451 RPSGYVCRQE GNECDLAEYC DGNSSSCPND VYKQDGTPCK YEGRCFRKGC 

501 RSRYMQCQS I FGPDAMEAPS ECYDAVNLIG DQFGNCE ITG IRNFKKCESA 

551 NSICGRLQCI NVET1 PDLPE HTTIISTHLQ AENLMCWGTG YHLSMKPMGI 

601 PDLGMINDGT SCGEGRVCFK KNCVNSSVLQ FDCLPEKCNT RGVCNNRKNC 

651 HCMYGWAPPF CEEVGYGGS I DSGPPGLLRG AIPSSIWWS IIMFRLILLI 

701 LSWFVFFRQ VIGNHLKPKQ EKMPLSKAKT EQEESKTKTV QEESKTKTGQ 

751 EESEAKTGQE ESKANIESKR PKAKSVKKQK K* (SEQ ID NO: 16) 

The discovery 7 of the nucleic acids of the invention enables the construction of 
expression vectors comprising nucleic acid sequences encoding polypeptides; host 
cells transfected or transformed with the expression vectors: isolated and purified 
biologically active polypeptides and fragments thereof; the use of the nucleic acids or 
oligonucleotides thereof as probes to identify nucleic acid encoding proteins having 
metalloproteinase-dismtegrin activity; the use of the nucleic acids or oligonucleotides 
thereof to identify human chromosome number 1 or 4; the use of the nucleic acids or 
oligonucleotides thereof to map genes on human chromosome number 1 or 4; the use 
of the nucleic acid or oligonucleotides thereof to identify genes associated with certain 
diseases, syndromes or other human conditions associated with human chromosome 
number 1 or 4, including fetal hydantoin syndrome, diphenylhydantoin toxicity, and 
pheochromocytoma; the use of single-stranded sense or antisense oligonucleotides 
from the nucleic acids to inhibit expression of polynucleotide encoded by the SVPH- 
1, SVPH-3, or SVPH-4 gene; the use of such polypeptides and soluble fragments to 
function as a proteinase; the use of such polypeptides and fragmented peptides as 
molecular w eight markers; the use o f such polypeptides and fragmented peptides as 
controls for peptide fragmentation, and kits comprising these reagents; the use of such 
polypeptides and fragments thereof to generate antibodies; and the use of antibodies to 
purify SVPH polypeptides. 
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NUCLEIC ACID MOLECULES 

In a particular embodiment, the invention relates to certain isolated nucleotide 
sequences that are free from contaminating endogenous material. A "nucleotide 
sequence" refers to a polynucleotide molecule in the form of a separate fragment or as 
a component of a larger nucleic acid construct. The nucleic acid molecule has been 
derived from DNA or RNA isolated at least once in substantially pure form and in a 
quantity or concentration enabling identification, manipulation, and recovery of its 
component nucleotide sequences by standard biochemical methods, such as those 
outlined in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd ed.. Cold 
Spring Harbor Laboratory, Cold Spring I iarbor, NY ( 1 989). Such sequences are 
preferably provided and/or constructed in the form of an open reading frame 
uninterrupted by internal non-translated sequences, or introns, that are typically 
present in eukaryotic genes. Sequences of non-translated DNA can be present 5* or 3' 
from an open reading frame, where the same do not interfere with manipulation or 
expression of the coding region. 

Nucleic acid molecules of the invention include DNA in both single-stranded 
and double-stranded form, as well as the RNA complement thereof. DNA includes, 
for example, cDNA, genomic DNA, chemically synthesized DNA, DNA amplified by 
PCR, and combinations thereof. Genomic DNA may be isolated by conventional 
techniques, e.g.. using the cDNA of SEQ ID NOs:l-3 and 7-1 1, or a suitable fragment 
thereof, as a probe. 

The DNA molecules of the invention include full length genes as well as 
polynucleotides and fragments thereof The full length gene mav include the N- 
terminal signal peptide. Other embodiments include DNA encoding a soluble form, 
e.g.. encoding the extracellular domain of the protein, either with or without the signal 
peptide. 

The nucleic acids of the invention are preferentially derived from human 
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Prefcrred Sequences 

Particularly preferred nucleotide sequences of the invention are SHQ ID 
NOs:l-3 and 7-1 1, as set forth above. The sequences of amino acids encoded by the 
DN A of SEQ ID NOs: 1 -3 and 7- 1 1 are shown in SEQ ID NOs:4-6 and 1 2- 1 6, 
respectively. In SEQ ID NO:l "N" can represent any nucleotide. These sequences 
identify the SVPH polynucleotides as members of the metalloproteinase-disintegrin 
family. As noted above, proteins of this family are characterized by a pro-domain, a 
disintegrin domain, a metalloproteinase domain, a cysteine rich region, a 
transmembrane domain, and a cytoplasmic tail. 

In particular, SVPH-1 (originally isolated from human testis) and SVPH-4 
(originally isolated from human testis, fetal lung, and B-cells) both share homology to 
the cysteine rich region of the metalloproteinase-disintegnn family, and SVPH-3 
(originally isolated from human fetus tissue) shares homology to the pro-domain of 
these family members. In addition, SVPH-4 polypeptide (SEQ ID NO:3) encodes a 
zinc binding motif (His 47 to Asp 58), a disintegrin domain (Leu 104 to Cys 179), and 
a cysteine rich region (Asp ISO to Arg 388). 

SVPH- la, SVPH- lb, and SVPH-lc represent the nucleotide sequences (SEQ 
ID NOs:7-9) of three alternatively spliced SVPH-1 clones with divergent cytoplasmic 
domains. These clones were isolated by screening a human testis library (Clonetech 
cat no. HL3024a) at 42°C and washing at 42°C in 2x SSC using four different 
oligonucleotides: 

GACCTAAGGTGTTCAATTCTT7G (SHQ ID NX): 17), 

C AAA T A C T G C A A G T G A G A ' T T G G (SHQ ID N( ): 1 X), 

T G G A G AA G T A G G T G T G G T G T A G G T' (SEQ IDNO:l ( >), and 

GAGCCACTGGAATTGAAAAAGTGCCC (SHQ ID NO:20) . 

SVPH-4a and SVPH-4H represent the nucleotide sequences (SHQ ID NOs: 1 0- 
1 1 ) of two alternatively spliced SYPH-4 clones with divergent cytoplasmic domains. 

oligonucleotides. 

AAA :AT - G A T ■ // • "G ' : ( SIX,) II) \( ):2 1 ). 
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CTTTCACGGAGCCCATGTAGTTGCAG (SEQ ID NO:22), and 
TGAAGGAGAAAACGCGCAGATGTCGG (SEQ ID NO:23). 

Additional Sequences 

Due to the known degeneracy of the genetic code, wherein more than one 
codon can encode the same amino acid, a DNA sequence can vary from that shown in 
SEQ ID NOs:l-3 and 7-11, and still encode a polypeptide having the amino acid 
sequence of SEQ ID NOs:4-6 and 12-16, respectively. Such variant DNA sequences 
can result from silent mutations {e.g., occurring during PGR amplification), or can he 
the product of deliberate mutagenesis of a native sequence. 

The invention thus provides isolated DNA sequences encoding polypeptides of 
the invention, selected from: (a) DNA comprising the nucleotide sequence of SEQ ID 
NOs:l-3 and 7-11; (b) DNA encoding the polypeptides of SEQ ID NOs:4-6 and 12- 
16; (c) DNA capable of hybridization to a DNA of (a) or (b) under conditions of 
moderate stringency and which encodes polypeptides of the invention; (d) DNA 
capable of hybridization to a DNA of (a) or (b) under conditions of high stringency 
and which encodes polypeptides of the invention, and (e) DNA which is degenerate as 
a result of the genetic code to a DNA defined in (a), (b), (c), or (d) and which encode 
polypeptides of the invention. Of course, polypeptides encoded by such DNA 
sequences are encompassed by the invention. 

As used herein, conditions of moderate stringency can be readily determined 
by those having ordinary skill in the art based on, for example, the length of the DNA. 
The basic conditions are set forth by Sambrook ct al. Molecular Cloning: A 
Laboratory- Manual, 2nd ed. Vol. 1, pp. 1.101-104, Cold Spring Harbor Laboratory- 
Press, (1989), and include use of a prewashing solution for the nitrocellulose filters 
5X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0), hybridization conditions of about 50% 
formamide, 6X SSC at about 42 °C (or other similar hybridization solution, such as 

; i i i ; i^u i-v ifiv . - r v i ; i ^ v : ,iii:>uii iMMii oil. ^.\ampK:. wic icnulii o; UK' i>.V\ 
Generally, such conditions arc defined as hybridization conditions as above, and with 
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washing at approximately 0S°C, 0.2X SSC, 0.1% SDS. The skilled artisan will 
recognize that the temperature and wash solution salt concentration can be adjusted as 
necessary according to factors such as the length of the probe. 

Also included as an embodiment of the invention is DNA encoding 
polypeptide fragments and polypeptides comprising inactivated N-glycosylation 
site(s), inactivated protease processing site(s), or conservative amino acid 
substitulion(s), as described below. 

In another embodiment, the nucleic acid molecules of the invention also 
comprise nucleotide sequences that are at least 80% identical to a native sequence. 
Also contemplated are embodiments in which a nucleic acid molecule comprises a 
sequence that is at least 90% identical, at least 95% identical, at least 98% identical, at 
least 99%; identical, or at least 99,9% identical to a native sequence. 

The percent identity may be determined by visual inspection and mathematical 
calculation. Alternatively, the percent identity of two nucleic acid sequences can be 
determined by comparing sequence information using the GAP computer program, 
version 6.0 described by Devereux ct ah, NucL Acids Res., 12:387 (1984) and 
available from the University of Wisconsin Genetics Computer Group (UWGCG). 
The preferred default parameters for the GAP program include: ( 1 ) a unary 
comparison matrix (containing a value of 1 for identities and 0 for non-identities) for 
nucleotides, and the weighted comparison matrix of Gribskov and Burgess, NucL 
Acids Res. 14:6745 (1986), as described by Schwartz and Dayhoff, eds.. Atlas of 
Protein Sequence and Structure. National Biomedical Research Foundation, pp. 353- 
358 (1979); (2) a penalty of 3.0 for each gap and an additional 0. 10 penalty for each 
symbol in each gap; and (3) no penalty for end gaps. Other programs used by one 
skilled in the art of sequence comparison may also be used. 

The invention also provides isolated nucleic acids useful in the production of 
polypeptides. Such polypeptides may be prepared by any of a number of conventional 



suitable leader or signal peptide Alternatively, the desired fragment may be 
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chcmically synthesized using known techniques. DNA fragments also may be 
produced by restriction endonuclease digestion of a full length cloned DNA sequence, 
and isolated by electrophoresis on agarose gels. If necessary, oligonucleotides that 
reconstruct the 5' or 3' terminus to a desired point may be ligated to a DNA fragment 
generated by restriction enzyme digestion. Such oligonucleotides may additionally 
contain a restriction endonuclease cleavage site upstream of the desired coding 
sequence, and position an initiation codon (ATG) at the N-terminus of the coding 
sequence. 

The well-known polymerase chain reaction (PCR) procedure also may be 
employed to isolate and amplify a DNA sequence encoding a desired protein 
fragment. Oligonucleotides that define the desired termini of the DNA fragment are 
employed as 5' and 3' primers. The oligonucleotides may additionally contain 
recognition sites for restriction endonucleases, to facilitate insertion of the amplified 
DNA fragment into an expression vector. PCR techniques are described in Saiki et 
ah. Science 239:487 (1988); Recombinant DNA Methodology, Wu et al., eds.. 
Academic Press, Inc., San Diego, pp. 3 89-196 (1989); and PCR Protocols: A Guide 
to Methods and Applications, Innis et al., eds.. Academic Press, Inc. (1990). 

POLYPEPTIDES AND FRAGMENTS THEREOF 

The invention encompasses polypeptides and fragments thereof in various 
forms, including those that are naturally occurring or produced through various 
techniques such as procedures involving recombinant DNA technology. Such forms 
include, but are not limited to, derivatives, variants, and oligomers, as well as fusion 
proteins or fragments thereof. 

Polypeptides and Fragments Thereof 

The polypeptides of the invention include the proteins encoded by the nucleic 
acid sequences set forth above. Particularly preferred polypeptides comprise the 

SFQ ID NO: > includes a pro-domain homologous to the same family of proteins. 
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SVPH-1 (SLQ ID NO:4) has an N-terminal region having amino acids Met 1 to Asn 
40. In SEQ ID NO:4 "X" can represent any amino acid. SVPH-3 (SEQ ID NO:5) has 
an N-terminal region having amino acids Asn 1 to Leu 23. SVPH-4 (SEQ ID NO:6) 
also includes an extracellular domain comprising amino acids His I to Arg 388, a 
transmembrane region comprising amino acids Gly 389 through Phe 417, and a C- 
terminal cytoplasmic domain comprising amino acids Arg 418 to Lys 499 and is 
believed to overlap with EST designated AA 782936. 

The SVPH-la polypeptide (SEQ ID NO:12), SVPH-lb polypeptide (SEQ ID 
NO:13), and SVPH-lc polypeptide (SEQ ID NO:14) each encodes a signal sequence 
(Met 1 to Ser 15), a pro-domain (Cys 16 to Ser 1 88), a catalytic domain (Ser 1 89 to 
Thr 388), a disintegrin domain (Val 389 to Gly 491 ). a cysteine rich region (Tyr 492 
to Lys 675), and a transmembrane domain (Phe 676 to Cys 698). In addition, each of 
the SVPH-la, SVPH-lb, and SVPH-lc polypeptides (SEQ ID NOs:12-14) encodes a 
cytoplasmic domain. Due to alternative splicing the cytoplasmic domain of each 
polypeptide is different. For SVPH-la, SVPH-lb, and SVPH-lc the cytoplasmic 
domains are (Lys 699 to Ser 766), (Lys 699 to Thr 787), and (Lys 699 to Ser 820), 
respectively. 

Similarly, the SVPH-4a polypeptide (SEQ ID NO: 1 5) and S VPH-4b 
polypeptide (SEQ ID NO: 16) each encodes a signal sequence (Met 1 to Gly 27), a 
pro-domain (Glu 28 to Arg 193), a catalytic domain (Asp 194 to lie 392), a disintegrin 
domain (Pro 393 to Gly 493), a cysteine rich region (Arg 494 to Ser 685), and a 
transmembrane domain (He 686 to Gly 713). In addition, each of the SVPH-4a and 
SVPH-4b polypeptides (SEQ ID NOs:l 5-16) encodes a cytoplasmic domain. Due to 
alternative splicing the cytoplasmic domain of each polypeptide is different. The 
cytoplasmic domain of SVPH-4a is (Asn 714 to Lys 790), and the cytoplasmic 
domain of SVPH-4b is (Asn 714 to Lys 781). 

The skilled artisan will recognize that the above-described boundaries of such 



The polvpeptules of the invention may be membrane bound or they may be 
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the cells in which they are expressed. In general, soluble polypeptides may be 
identified (and distinguished from non-soluble membrane-bound counterparts) by 
separating intact cells which express the desired polypeptide from the culture medium, 
e.g., by centnfugation, and assaying the medium (supernatant) for the presence of the 
desired polypeptide. The presence of polypeptide in the medium indicates that the 
polypeptide was secreted from the cells and thus is a soluble form of the protein. 

In one embodiment, the soluble polypeptides and fragments thereof comprise 
all or part of the extracellular domain, but lack the transmembrane region that would 
cause retention of the polypeptide on a cell membrane. A soluble polypeptide may 
include the cytoplasmic domain, or a portion thereof, as long as the polypeptide is 
secreted from the cell in which it is produced. 

In general, the use of soluble forms is advantageous for certain applications. 
Purification of the polypeptides from recombinant host cells is facilitated, since the 
soluble polypeptides are secreted from the cells. Further, soluble polypeptides are 
generally more suitable for intravenous administration. 

The invention also provides polypeptides and fragments of the extracellular 
domain that retain a desired biological activity. Particular embodiments are directed 
to polypeptide fragments that retain the ability to bind the "binding partner" or the 
native cognates, substrates, or counter-structure. Such a fragment may be a soluble 
polypeptide, as described above. In another embodiment, the polypeptides and 
fragments advantageously include regions that are conserved in the SVPH family as 
described above. 

Also provided herein are polypeptide fragments comprising at least 20, or at 
least 30, contiguous amino acids of the sequences of SEQ ID NOs:4-6 and 12-16. 
Fragments derived from the cytoplasmic domain find use in studies of signal 
transduction, and in regulating cellular processes associated w ith transduction of 
biological signals. Polypeptide fragments also may be employed as immunogens, in 
, :r;,- . i; 

Naturallv occurring variants as well as derived variants of the polypeptides and 
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Variants may exhibit amino acid sequences that are at least 80% identical. 
Also contemplated are embodiments in which a polypeptide or fragment comprises an 
amino acid sequence that is at least 90% identical, at least 95% identical, at least 98%) 
identical, at least 99% identical, or at least 99.9% identical to the preferred 
polypeptide or fragment thereof. Percent identity may be determined by visual 
inspection and mathematical calculation. Alternatively, the percent identity of two 
protein sequences can be determined by comparing sequence information using the 
GAP computer program, based on the algorithm of Needleman and Wunsch, J. Moi 
BioL, 48:443 (1970) and available from the University of Wisconsin Genetics 
Computer Group (UWGCG). The preferred default parameters for the GAP program 
include: (1) a scoring matrix, blosum62, as described by Henikoff and Henikoff, 
Proc. Natl. Acad. Sci. USA, 89:10915 (1992); (2) a gap weight of 12; (3) a gap 
length weight of 4; and (4) no penalty for end gaps. Other programs used by one 
skilled in the art of sequence comparison may also be used. 

The variants of the invention include, for example, those that result from 
alternate mRNA splicing events or from proteolytic cleavage. Alternate splicing of 
mRNA may, for example, yield a truncated but biologically active protein, such as a 
naturally occurring soluble form of the protein. Variations attributable to proteolysis 
include, for example, differences in the N- or C-termini upon expression in different 
types of host cells, due to proteolytic removal of one or more terminal amino acids 
from the protein (generally from 1-5 terminal amino acids). Proteins in which 
differences in amino acid sequence are attributable to genetic polymorphism (allelic 
variation among individuals producing the protein) are also contemplated herein. 

Additional variants within the scope of the invention include polypeptides that 
may be modified to create derivatives thereof by forming covalent or aggregative 
conjugates with other chemical moieties, such as glycosyl groups, lipids, phosphate, 

acetyl groups and the like. Covalent derivatives may be prepared by linking the 

i- ..>>; ,i .> • ; * * : • 1 • 

detail below. 
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Other derivatives include covalent or aggregative conjugates of the 
polypeptides with other proteins or polypeptides, such as by synthesis in recombinant 
culture as N-terminal or C-terminal fusions. Examples of fusion proteins are 
discussed below in connection with oligomers. Further, fusion proteins can comprise 
peptides added to facilitate purification and identification. Such peptides include, for 
example, poly-His or the antigenic identification peptides described in U.S. Patent No. 
5,01 1,912 and in Hopp et al., Bio/Technology 6:1204 (1988). One such peptide is the 
FLAG" peptide, Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (SEQ ID NO:28). which is 
highly antigenic and provides an epitope reversibly bound by a specific monoclonal 
antibody, enabling rapid assay and facile purification of expressed recombinant 
protein. A murine hybridoma designated 4E1 1 produces a monoclonal antibody that 
binds the FLAG' peptide in the presence of certain divalent metal cations, as 
described in U.S. Patent 5,01 1,912, hereby incorporated by reference. The 4E1 1 
hybridoma cell line has been deposited with the American Type Culture Collection 
under accession no. HB 9259. Monoclonal antibodies that bind the FLAG* peptide 
are available from Eastman Kodak Co., Scientific Imaging Systems Division, New 
Haven, Connecticut. 

Among the variant polypeptides provided herein are variants of native 
polypeptides that retain the native biological activity or the substantial equivalent 
thereof One example is a variant that binds with essentially the same binding affinity 
as does the native form. Binding affinity can be measured by conventional 
procedures, e.g., as described in U.S. Patent No. 5,512,457 and as set forth below. 

Variants include polypeptides that are substantially homologous to the native 
form, but which have an amino acid sequence different from that of the native form 
because of one or more deletions, insertions or substitutions. Particular embodiments 
include, but are not limited to, polypeptides that comprise from one to ten deletions, 
insertions or substitutions of amino acid residues, when compared to a native 
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Glu and Asp, or Gin and Asn; or substitutions of one aromatic residue for another, 
such as Phe, Tip, or Tyr for one another. Other conserv ative substitutions, e.g.. 
involving substitutions of entire regions having similar hydrophobicity characteristics, 
are well known. 

Similarly, the DNAs of the invention include variants that differ from a native 
DNA sequence because of one or more deletions, insertions or substitutions, but that 
encode a biologically active polypeptide. 

The invention further includes polypeptides of the invention with or without 
associated native-pattern glycosylation. Polypeptides expressed in yeast or 
mammalian expression systems (e.g., COS-1 or COS-7 cells) can be similar to or 
significantly different from a native polypeptide in molecular weight and 
glycosylation pattern, depending upon the choice of expression system. Expression of 
polypeptides of the invention in bacterial expression systems, such as E. coli, provides 
non-glycosylated molecules. Further, a given preparation may include multiple 
differentially glycosylated species of the protein. Glycosyl groups can be removed 
through conventional methods, in particular those utilizing glycopeptidase. In 
general, glycosylated polypeptides of the invention can be incubated with a molar 
excess of glycopeptidase (Boehringer Mannheim). 

Correspondingly, similar DNA constructs that encode various additions or 
substitutions of amino acid residues or sequences, or deletions of terminal or internal 
residues or sequences are encompassed by the invention. For example, N- 
glvcosylation sites in the polypeptide extracellular domain can be modified to 
preclude glycosylation, allowing expression of a reduced carbohydrate analog in 
mammalian and vcast expression systems. N-glycosylation sites in eukaryotic 
polypeptides are characterized by an amino acid triplet Asn-X-Y, wherein X is any 
amino acid except Pro and Y is Scr or Thr. Appropriate substitutions, additions, or 
deletions to the nucleotide sequence encoding these triplets will result in prevention of 

>u i I ic icni h 1 mac 1 1 \ ale a; i \ g i_w \ luu u . >i u . Mk'in.ti.vu;.. tiic ;>u i i :i ^ ai i ; > , 
replaced with another amino acid, such as Ala. Known procedures tor inactivating N- 
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glycosylation sites in proteins include those described in U.S. Patent 5,071,972 and 
EP 276,846, hereby incorporated by reference. 

In another example of variants, sequences encoding Cys residues that are not 
essential for biological activity can be altered to cause the Cys residues to be deleted 
or replaced with other amino acids, preventing formation of incorrect intramolecular 
disulfide bridges upon folding or renaturation. 

Other variants are prepared by modification of adjacent dibasic amino acid 
residues, to enhance expression in yeast systems in which KEX2 protease activity is 
present. EP 212,914 discloses the use of site-specific mutagenesis to inactivate KEX2 
protease processing sites in a protein. 1CEX2 protease processing sites are inactivated 
by deleting, adding or substituting residues to alter Arg-Arg, Arg-Lys, and Lys-Arg 
pairs to eliminate the occurrence of these adjacent basic residues. Lys-Lys pairings 
are considerably less susceptible to KEX2 cleavage, and conversion of Arg-Lys or 
Lys-Arg to Lys-Lys represents a conservative and preferred approach to inactivating 
KEX2 sites. 



Encompassed by the invention are oligomers or fusion proteins that contain 
SVPH polypeptides. When the polypeptide of the invention is a type I membrane 
protein, such as SVPH, the fusion partner is linked to the C terminus of the type I 
membrane protein. Such oligomers may be in the form of covalently-linked or non- 
covalently-linked multimcrs, including dimers, tnmers, or higher oligomers. As noted 
above, preferred polypeptides are soluble and thus these oligomers mav comprise 
soluble polypeptides. In one aspect of the invention, the oligomers maintain the 
binding ability of the polypeptide components and provide therefor, bivalent, 
trivalcnt, etc., binding sites. 

One embodiment of the invention is directed to oligomers comprising multiple 

pep iuie> thai na\e me propcm o; ptoinrtiuiij. o.igomci izaiion. i cue me zippers a no 
certain polypeptides derived from antibodies are among the peptides that can promote 
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oli^omerization of the polypeptides attached thereto, as described in more detail 



below. 



Immunoglobulin-based Oliuomers 



As one alternative, an oligomer is prepared using polypeptides derived from 
immunoglobulins. Preparation of fusion proteins comprising certain heterologous 
polypeptides fused to various portions of antibody-denvcd polypeptides (including the 
Fc domain) has been described, e.g., by Ashkenazi et al., PNAS USA, 88:10535 
(1991); Byrn et al.. Nature, 344:677 (1990); and Hollenbaugh and Aruffo, 
"Construction of Immunoglobulin Fusion Proteins", Current Protocols in 
Immunology. Suppl. 4, pages 10.19.1 - 10.19.1 1 (1992). 

One embodiment of the present invention is directed to a dimer comprising 
two fusion proteins created by fusing a polypeptide of the invention to an Fc 
polypeptide derived from an antibody. A gene fusion encoding the polypeptide/Fc 
fusion protein is inserted into an appropnate expression vector. Polypeptide/Fc fusion 
proteins are expressed in host cells transformed with the recombinant expression 
vector, and allowed to assemble much like antibody molecules, whereupon interchain 
disulfide bonds form between the Fc moieties to yield divalent molecules. 

The term "Fc polypeptide" as used herein includes native and mutein forms of 
polypeptides made up of the Fc region of an antibody comprising any or all of the CH 
domains of the Fc region. Truncated forms of such polypeptides containing the hinge 
region that promotes dimerization are also included. Preferred polypeptides comprise 
an Fc polypeptide derived from a human IgGl antibody. 

One suitable Fc polypeptide, described in PCT application WO 93 10151 
(hereby incorporated by reference), is a single chain polypeptide extending from the 
N-terminal hinge region to the native C-tcnninus of the Fc region of a human IgGl 
antibody. Another useful Fc polypeptide is the Fc mutein described in U.S. Patent 
5,457,035 and in Baum et al., EM BO ./. 13:3992-4001 (1994), incorporated herein by 
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The above-described fusion proteins comprising Fc moieties (and oligomers 
formed therefrom) offer the advantage of facile purification by affinity 
chromatography over Protein A or Protein G columns. 

In other embodiments, the polypeptides of the invention may be substituted for 
the variable portion of an antibody heavy or light chain. If fusion proteins are made 
with both heavy and light chains of an antibody, it is possible to form an oligomer 
with as many as four SVPH extracellular regions. 



Alternatively, the oligomer is a fusion protein comprising multiple 
polypeptides, with or without peptide linkers (spacer peptides). Among the suitable 
peptide linkers are those described in U.S. Patents 4,751,180 and 4,935,233, which are 
hereby incorporated by reference. A DNA sequence encoding a desired peptide linker 
may be inserted between, and in the same reading frame as, the DNA sequences of the 
invention, using any suitable conventional technique. For example, a chemically 
synthesized oligonucleotide encoding the linker may be ligated between the 
sequences. In particular embodiments, a fusion protein comprises from two to four 
soluble SVPH polypeptides, separated by peptide linkers. 



Another method for preparing the oligomers of the invention involves use of a 
leucine zipper. Leucine zipper domains are peptides that promote oligomerization of 
the proteins in which they are found. Leucine zippers were originally identified in 
several DNA-binding proteins (Landschulz et ah. Science, 240:1759 (1988)), and have 
since been found in a variety of different proteins. Among the known leucine zippers 
are naturally occurring peptides and derivatives thereof that dimerize or trimcrize. 

The zipper domain (also referred to herein as an oligomerizing. or oligomer- 
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product of the murine proto-oncogene, c-myc (Landschulz et al.. Science 240:1759 
(1988)). The zipper domains offos and jun preferentially form heterodimer (O'Shea 
et al.. Science 245:646 ( 1 989), Turner and Tjian. Science, 243:1689 (1989)). The 
zipper domain is necessary for biological activity (DNA binding) in these proteins. 

The fusogenic proteins of several different viruses, including paramyxovirus, 
coronavirus. measles virus and many retroviruses, also possess zipper domains 
(Buckland and Wild, Nature 338:547 (1989); Britton, Nature, 353:394 (1991 ); 
Delwart and Mosialos, AIDS Research and Human Retroviruses 6:703 (1990)). The 
zipper domains in these fusogenic viral proteins are near the transmembrane region of 
the proteins; it has been suggested that the zipper domains could contribute to the 
oligomeric structure of the fusogenic proteins. Oligomerization of fusogenic viral 
proteins is involved in fusion pore formation (Spruce et al., Proc. Natl. Acad. Sci. 
U.S.A., 88:3523 (1991)). Zipper domains have also been recently reported to play a 
role in oligomerization of heat-shock transcription factors (Rabindran et al., Science 
259:230(1993)). 

Zipper domains fold as short, parallel coiled coils (O'Shea et al., Science 
254:539 (1991 )). The general architecture of the parallel coiled coil has been well 
characterized, with a "knobs-into-holes" packing as proposed by Crick, Acta 
Crystallogr., 6:689 (1953). The dimer formed by a zipper domain is stabilized by the 
heptad repeat, designated (abcdefg) n according to the notation of McLachlan and 
Stewart, J. Mol. Biol., 98:293 (1975), in which residues a and d are generally 
hydrophobic residues, with d being a leucine, which line up on the same lace of a 
helix. Oppositely-charged residues commonly occur at positions g and t\ Thus, in a 
parallel coiled coil formed from two helical zipper domains, the "knobs" formed by 
the hydrophobic side chains of the first helix arc packed into the "holes" formed 
between the side chains of the second helix. 

The residues at position d (often leucine) contribute large hydrophobic 
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studies also indicate that electrostatic interactions contribute to the stoichiometry and 
geometry of coiled coils. Further discussion of the structure of leucine zippers is 
found in Harbury et ah. Science. 262:1401 (1993). 

Examples of leucine zipper domains suitable for producing soluble oligomeric 
proteins are described in PCT application WO 94/10308, as well as the leucine zipper 
derived from lung surfactant protein D (SPD) described in Hoppe et al., FEBS Letters, 
344:191 (1994), hereby incorporated by reference. The use of a modified leucine 
zipper that allows for stable trimerization of a heterologous protein fused thereto is 
described in Fanslow et al., Semin. Immunol., 6:267-278 (1994). Recombinant fusion 
proteins comprising a soluble polypeptide fused to a leucine zipper peptide are 
expressed in suitable host cells, and the soluble oligomer that forms is recovered from 
the culture supernatant. 

Certain leucine zipper moieties preferentially form tnmers. One example is a 
leucine zipper derived from lung surfactant protein D (SPD) noted above, as described 
in Hoppe et al., FEES Letters, 344:191 (1994) and in U.S. Patent 5,716,805, hereby 
incorporated by reference in their entirety. This lung SPD-derived leucine zipper 
peptide comprises the amino acid sequence Pro Asp Val Ala Ser Leu Arg Gin Gin Val 
Glu Ala Leu Gin Gly Gin Val Gin His Leu Gin Ala Ala Phc Ser Gin Tyr (SEQ ID 
NO:29). 

Another example of a leucine zipper that promotes trimerization is a peptide 
comprising the amino acid sequence Arg Met Lys Gin He Glu Asp Lys lie Glu Glu lie 
Leu Ser Lys lie Tyr His lie Glu Asn Glu He Ala Arg He Lys Lys Leu He Gly Glu Arg 
(SFQ ID NO:3(>). as described in l\S. Patent 5,716.805. In one alternative 
embodiment, an N-tcrminal Asp residue is added; in another, the peptide lacks the \- 
terminal Arg residue. 

Fragments of the foregoing zipper peptides that retain the propertv of 
promoting oligomeri/ation may be employed as well. Examples of such fragments 

ina\ he derived I mm iuiiin\iii\ o^vunme leucine zipper peptide^, e.g.. \ nj con>er\atn^ 
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Other peptides denved from naturally occurring trimenc proteins may be 
employed in preparing tnmeric SVPH. Alternatively, synthetic peptides that promote 
oligomenzation may be employed. In particular embodiments, leucine residues in a 
leucine zipper moiety are replaced by isoleucine residues. Such peptides comprising 
isoleucine may be referred to as isoleucine zippers, but are encompassed by the term 
"leucine zippers" as employed herein. 

PRODUCTION OF POLYPEPTIDES AND FRAGMENTS THEREOF 

Expression, isolation and purification of the polypeptides and fragments of the 
invention may be accomplished by any suitable technique, including but not limited to 
the following: 

Expression Systems 

The present invention also provides recombinant cloning and expression 
vectors containing DNA, as well as host cell containing the recombinant vectors. 
Expression vectors comprising DNA may be used to prepare the polypeptides or 
fragments of the invention encoded by the DNA. A method for producing 
polypeptides comprises culturing host cells transformed with a recombinant 
expression vector encoding the polypeptide, under conditions that promote expression 
of the polypeptide, then recovering the expressed polypeptides from the culture. The 
skilled artisan will recognize that the procedure for purifying the expressed 
polypeptides will vary according to such factors as the type of host cells employed, 
and whether the polypeptide is membrane-bound or a soluble form that is secreted 
from the host cell. 

Any suitable expression system may be employed. The vectors include a 
DNA encoding a polypeptide or fragment of the invention, opcrably linked to suitable 
transcriptional or translational regulatory nucleotide sequences, such as those derived 
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nucleotide sequence is operably linked to a DNA sequence if the promoter nucleotide 
sequence controls the transcription of the DNA sequence. An origin of replication 
that confers the ability to replicate in the desired host cells, and a selection gene by 
which transformants are identified, are generally incorporated into the expression 
vector. 

In addition, a sequence encoding an appropriate signal peptide (native or 
heterologous) can be incorporated into expression vectors. A DNA sequence for a 
signal peptide (secretory leader) may be fused in frame to the nucleic acid sequence of 
the invention so that the DNA is initially transcribed, and the mRNA translated, into a 
fusion protein comprising the signal peptide. A signal peptide that is functional in the 
intended host cells promotes extracellular secretion of the polypeptide. The signal 
peptide is cleaved from the polypeptide upon secretion of polypeptide from the cell. 

The skilled artisan will also recognize that the position(s) at which the signal 
peptide is cleaved may differ from that predicted by computer program, and may vary 
according to such factors as the type of host cells employed in expressing a 
recombinant polypeptide. A protein preparation may include a mixture of protein 
molecules having different N-terminal amino acids, resulting from cleavage of the 
signal peptide at more than one site. 

Suitable host cells for expression of polypeptides include prokaryotes, yeast or 
higher eukaryotic cells. Mammalian or insect cells are generally preferred for use as 
host cells. Appropriate cloning and expression vectors for use with bacterial, fungal, 
yeast, and mammalian cellular hosts are described, for example, in Pouwels et al. 
Cloning J'ectors: A Laboratory* Manual, Elsevier. New York, (1985). Cell-free 
translation systems could also be employed to produce polypeptides using RNAs 
derived from DNA constructs disclosed herein. 



Prokaryotic Systems 
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exprcssion of the recombinant polypeptide in the prokaryotic host cell. The N- 
terminal Met may be cleaved from the expressed recombinant polypeptide. 

Expression vectors for use in prokaryotic host cells generally comprise one or 
more phenotypic selectable marker genes. A phcnotypic selectable marker gene is, for 
example, a gene encoding a protein that confers antibiotic resistance or that supplies 
an autotrophic requirement. Examples of useful expression vectors for prokaryotic 
host cells include those derived from commercially available plasmids such as the 
cloning vector pBR322 (ATCC 37017). pBR322 contains genes for ampicillin and 
tetracycline resistance and thus provides simple means for identifying transformed 
cells. An appropriate promoter and a DNA sequence are inserted into the pBR322 
vector. Other commercially available vectors include, for example, pKK223-3 
(Pharmacia Fine Chemicals, Uppsala, Sweden) and pGEMl (Promega Biotec, 
Madison, WI, USA). 

Promoter sequences commonly used for recombinant prokaryotic host cell 
expression vectors include P-lactamase (penicillinase), lactose promoter system 
(Chang et ak, Nature t 275:615 (1978); and Goeddel et al., Nature. 257:544 (1979)), 
tryptophan (trp) promoter system (Goeddel et al., Nucl. Acids Res., 5:4057 (1980)); 
and EP-A-36776 and tac promoter (Maniatis, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, p. 412 (1982)). A particularly useful 
prokaryotic host cell expression system employs a phage A.P L promoter and a cI857ts 
thermolabile repressor sequence. Plasmid vectors available from the American Type 
Culture Collection which incorporate derivatives of the XP l promoter include plasmid 
pHUB2 (resident in E. coli strain JMB9, ATCC 37092) and pPLc2S (resident in E. 
coli RRK ATCC 53082). 

Yeast Systems 

Alternatively, the polypeptides may be expressed in yeast host cells, preferably 
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for yeast vectors include, among others, promoters for metallothioncin, 3- 
phosphoglycerate kinase (Hitzeman et ah, J. Biol. Chcm. 2.55:2073 ( 19S0)) or other 
glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg.. 7: 149 ( 1 968); and Holland et 
al., Biochem.. 7 7:4900 ( 1978)), such as enolase, glyceraldehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phospho-glucose isomerase, and glucokinase. Other suitable vectors and 
promoters for use in yeast expression are further described in Hitzeman, EPA-73,657. 
Another alternative is the glucose-repressible ADH2 promoter described by Russell et 
al., J. Biol. Chcm., 258:2614 (1982) and Beier et al.. Nature, 300:124 (1982). Shuttle 
vectors replicable in both yeast and E. coli may be constructed by inserting DNA 
sequences from pBR322 for selection and replication in E. coli (Amp r gene and origin 
of replication) into the above-described yeast vectors. 

The yeast a-factor leader sequence may be employed to direct secretion of the 
polypeptide. The a-factor leader sequence is often inserted between the promoter 
sequence and the structural gene sequence. Sec, Kurjan et al.. Cell, 30:933 (1982) and 
Bitter et al., Proc. Natl. Acad. Sci. USA, 81:5330 ( 1 984). Other leader sequences 
suitable for facilitating secretion of recombinant polypeptides from yeast hosts are 
known to those of skill in the art. A leader sequence may be modified near its 3* end 
to contain one or more restriction sites. This will facilitate fusion of the leader 
sequence to the structural gene. 

Yeast transformation protocols are known to those of skill in the art. One such 
protocol is described by Hinnen et al., Proc. Natl. Acad. Sci. I'SA. 75:1929 (1978). 
The Hinnen et al. protocol selects for Trp transformants in a selective medium, 
wherein the selective medium consists of 0.67% yeast nitrogen base, 0.5% casamino 
acids, 2% glucose, 10 mg/ml adenine and 20 mg/ml uracil. 

Yeast host cells transformed by vectors containing an ADH2 promoter 
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ADH2 promoter occurs w hen glucose is exhausted from the medium. 
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Mammalian or Insect Systems 

Mammalian or insect host cell culture systems also may be employed to 
express recombinant polypeptides. Bacculovirus systems for production of 
heterologous proteins in insect cells are reviewed by Luckow and Summers, 
Bio/Technology 6:47 (1988). Established cell lines of mammalian origin also may be 
employed. Examples of suitable mammalian host cell lines include the COS-7 line of 
monkey kidney cells (ATCC CRL 1651 ) (Gluzman et al., Cell 23A15 (1981)), L cells, 
CI 27 cells, 3T3 cells (ATCC CCL 163), Chinese hamster ovary (CHO) cells, HeLa 
cells, and BHK (ATCC CRL 10) cell lines, and the CV1/EBNA cell line derived from 
the African green monkey kidney cell line CV1 (ATCC CCL 70) as described by 
McMahan et al., EMBO J. 10: 2821 (1991). 

Established methods for introducing DNA into mammalian cells have been 
described (Kaufman, RJ., Large Scale Mammalian Cell Culture, pp. 15-69 (1990)). 
Additional protocols using commercially available reagents, such as Lipofectamine 
lipid reagent (Gibco/BRL) or Lipofectamine-Plus lipid reagent, can be used to 
transfect cells (Feigner et al., Proc. Natl. Acad. Sci. USA 5^:741 3-741 7 ( 1987)). in 
addition, electroporation can be used to transfect mammalian cells using conventional 
procedures, such as those in Sambrook et al., Molecular Cloning: A Laboratory' 
ManuaL 2nd ed. Vol. 1-3, Cold Spring Harbor Laboratory Press (1989). Selection of 
stable transformants can be performed using methods known in the art, such as, for 
example, resistance to cytotoxic drugs. Kaufman et al., Mcth. in Enzvmology 
7(V5:487-51 1 (1990), describes several selection schemes, such as dihydrofolate 
reductase (DHFR) resistance. A suitable host strain for DHFR selection can be CHO 
strain DX-B1 1. which is deficient in DHFR (LJrlaub and Chasm, Proc. Natl. Acad. 
Sci. USA 77:4216-4220 (1980)). A plasmid expressing the DHFR cDNA can be 
introduced into strain DX-B1 1, and only cells that contain the plasmid can grow in the 
appropriate selective media. Other examples of selectable markers that can be 
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Transcriptional and translalional control sequences for mammalian host cell 
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sequences and enhancer sequences are derived from polyoma virus, adenovirus 2, 
simian virus 40 (SV40), and human cytomegalovirus. DNA sequences derived from 
the SV40 viral genome, for example, SV40 origin, early and late promoter, enhancer, 
splice, and polyadenylation sites can be used to provide other genetic elements for 
expression of a structural gene sequence in a mammalian host cell. Viral early and 
late promoters are particularly useful because both are easily obtained from a viral 
genome as a fragment, which can also contain a viral origin of replication (P iers et al.. 
Nature 273\\ 13 (1978); Kaufman et al., Mcth. in Enzymology 7*5:487-5 1 1 (1990)). 
Smaller or larger SV40 fragments can also be used, provided the approximately 250 
bp sequence extending from the Hind III site toward the Bgl I site located in the SV40 
viral origin of replication site is included. 

Additional control sequences shown to improve expression of heterologous 
genes from mammalian expression vectors include such elements as the expression 
augmenting sequence element (EASE) derived from CHO cells (Morris et al, Animal 
Cell Technology, pp. 529-534 (1997)) and PCT Application WO 97/25420 and the 
tripartite leader (TPL) and VA gene RNAs from Adenovirus 2 (Gingeras et ah, J. 
Biol. Chem. 257:13475-13491 (1982)). The internal ribosome entry site (IRES) 
sequences of viral origin allows dicistronic mRNAs to be translated efficiently (Oh 
and Sarnow, Current Opinion in Genetics and Development 3:295-300 (1993); 
Ramesh et al., Nucleic Acids Research 2^:2697-2700 (1996)). Expression of a 
heterologous cDNA as part of a dicistronic mRNA followed by the gene for a 
selectable marker (e.g. DHFR) has been shown to improve transferability of the host 
and expression of the heterologous cDNA (Kaufman et al., Mali, in Enzvmology 
/ #5:487-51 1 (1990)). Exemplary expression vectors that employ dicistronic mRNAs 
are pTR-DC/GFP described by Mosser et al., Biotechniques 22:150-161 (1997), and 
p2A5I described by Morris et al.. Animal Cell Technology^ pp. 529-534 (1997). 

A useful high expression vector, pCAVNOT, has been described by Moslev et 
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PMLSV N1/N4, described by Cosman et al.. Nature 312:768 (1984), has been 
deposited as ATCC 39890. Additional useful mammalian expression vectors are 
described in EP-A-0367566, and in WO 91/18982, incorporated by reference herein. 
In yet another alternative, the vectors can be derived from retroviruses. 

Additional useful expression vectors, pFLAG T and pDC31 1, can also be used. 
FLAG* technology is centered on the fusion of a low molecular weight ( lkD), 
hydrophilic, FLAG* marker peptide to the N-terminus of a recombinant protein 
expressed by pFLAG* expression vectors. pDC31 1 is another specialized vector used 
for expressing proteins in CHO cells. pDC31 1 is characterized by a bicistronic 
sequence containing the gene of interest and a dihydrofolate reductase (DHFR) gene 
with an internal ribosome binding site for DHFR translation, an expression 
augmenting sequence element (EASE), the human CMV promoter, a tripartite leader 
sequence, and a polyadenylation site. 

Regarding signal peptides that may be employed, the native signal peptide 
may be replaced by a heterologous signal peptide or leader sequence, if desired. The 
choice of signal peptide or leader may depend on factors such as the type of host cells 
in which the recombinant polypeptide is to be produced. To illustrate, examples of 
heterologous signal peptides that are functional in mammalian host cells include the 
signal sequence for interleukin-7 (IL-7) described in United States Patent 4,965,195; 
the signal sequence for interleukin-2 receptor described in Cosman et al., Nature, 
312:768 (1984); the interleukin-4 receptor signal peptide described in EP 367,566; the 
type I intcrleukin-1 receptor signal peptide described in U.S. Patent 4.968,607; and the 
type II interleukin-1 receptor signal peptide described in EP 460,846. 

Purification 

The invention also includes methods of isolating and purifying the 
polypeptides and fragments thereof. 
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Isolation and Purification 



The "isolated" polypeptides or fragments thereof encompassed by this 
invention are polypeptides or fragments that are not in an environment identical to an 
environment in which it or they can be found in nature. The "purified" polypeptides 
or fragments thereof encompassed by this invention are essentially free of association 
with other proteins or polypeptides, for example, as a purification product of 
recombinant expression systems such as those described above or as a purified 
product from a non-recombinant source such as naturally occurring cells and/or 
tissues. 

In one preferred embodiment, the purification of recombinant polypeptides or 
fragments can be accomplished using fusions of polypeptides or fragments of the 
invention to another polypeptide to aid in the purification of polypeptides or 
fragments of the invention. Such fusion partners can include the poly-His or other 
antigenic identification peptides described above as well as the Fc moieties described 
previously. 

With respect to any type of host cell, as is known to the skilled artisan, 
procedures for purifying a recombinant polypeptide or fragment will vary according to 
such factors as the type of host cells employed and whether or not the recombinant 
polypeptide or fragment is secreted into the culture medium. 

In general, the recombinant polypeptide or fragment can be isolated from the 
host cells if not secreted, or from the medium or supernatant if soluble and secreted, 
followed by one or more concentration, salting-out, ion exchange, hydrophobic 
interaction, affinity purification or size exclusion chromatography steps. As to 
specific ways to accomplish these steps, the culture medium first can be concentrated 
using a commercially available protein concentration filter, for example, an Amicon 
or Millipore Pellicon ultrafiltration unit. Following the concentration step, the 
concentrate can be applied to a purification matrix such as a gel filtration medium. 



purification. Alternatively, a cation exchange step can be employed. Suitable cation 
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carboxvmethyl groups. In addition, a chromatofocusmg step can be employed. 
Alternatively, a hydrophobic interaction chromatography step can be employed. 
Suitable matrices can be phenyl or octyl moieties bound to resins. In addition, affinity 
chromatography with a matrix which selectively binds the recombinant protein can be 
employed. Hxamples of such resins employed are lectin columns, dye columns, and 
metal-chelating columns. Finally, one or more reversed-phase high performance 
liquid chromatography (RP-HPLO steps employing hydrophobic RP-HPLC media, 
(e.g., silica gel or polymer resin having pendant methyl, octyl, octyldecyl or other 
aliphatic groups) can be employed to further purity the polypeptides. Some or all of 
the foregoing purification steps, in various combinations, are well known and can be 
employed to provide an isolated and purified recombinant protein. 

It is also possible to utilize an affinity column comprising a polypeptide- 
binding protein of the invention, such as a monoclonal antibody generated against 
polypeptides of the invention, to affinity-purify expressed polypeptides. These 
polypeptides can be removed from an affinity column using conventional techniques, 
e.g., in a high salt elution buffer and then dialyzed into a lower salt buffer for use or 
by changing pH or other components depending on the affinity matrix utilized, or be 
competitively removed using the naturally occurring substrate of the affinity moiety, 
such as a polypeptide derived from the invention. 

In this aspect of the invention, polypeptide-binding proteins, such as the ami- 
polypeptide antibodies of the invention or other proteins that may interact with the 
polypeptide of the invention, can be bound to a solid phase support such as a column 
chromatography matrix or a similar substrate suitable for identifying, separating, or 
purifying cells that express polypeptides of the invention on their surface. Adherence 
of polypeptide-binding proteins of the invention to a solid phase contacting surface 
can be accomplished by any means, for example, magnetic microspheres can be 
coated with these polypeptide-binding proteins and held in the incubation vessel 
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Methods of releasing positively selected cells from the solid phase are known in the 
art and encompass, for example, the use of enzymes. Such enzymes are preferably 
non-toxic and non-injurious to the cells and are preferably directed to cleaving the 
cell-surface binding partner. 

Alternatively, mixtures of cells suspected of containing polypeptide- 
cxpressing cells of the invention first can be incubated with a biotinylated 
polypcptide-binding protein of the invention. Incubation periods are typically at least 
one hour in duration to ensure sufficient binding to polypeptides of the invention. The 
resulting mixture then is passed through a column packed with avidin-coated beads, 
whereby the high affinity of biotin for avidin provides the binding of the polypcptide- 
binding cells to the beads. Use of avidin-coated beads is known in the art. Sec, 
Berenson, et ah, J. Cell Biochem., 10D:239 (1986). Wash of unbound material and 
the release of the bound cells is performed using conventional methods. 

The desired degree of purity depends on the intended use of the protein. A 
relatively high degree of purity is desired when the polypeptide is to be administered 
/// vivo, for example. In such a case, the polypeptides are purified such that no protein 
bands corresponding to other proteins arc detectable upon analysis by SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE). It will be recognized by one skilled 
in the pertinent field that multiple bands corresponding to the polypeptide may be 
visualized by SDS-PAGE, due to differential glycosylation, differential post- 
translational processing, and the like. Most preferably, the polypeptide of the 
invention is purified to substantial homogeneity, as indicated by a single protein band 
upon analysis by SDS-PAGE. The protein band may be visualized bv silver staining. 
Coomassie blue staining, or (if the protein is radiolabeled) by autoradiography. 



Assavs 



The purified polypeptides of the invention (including proteins, polypeptides. 
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binding partner. The cells then are washed to remove unbound labeled polypeptide, 
and the presence of cell-bound label is determined by a suitable technique, chosen 
according to the nature of the label. 

One example of a binding assay procedure is as follows. A recombinant 
expression vector containing binding partner cDNA is constructed using methods well 
known in the art. The binding partner comprises an N-terminal cytoplasmic domain, a 
transmembrane region, and a C-terminal extracellular domain. CYl-HBNA-l cells in 
10 enr dishes are transfected with the recombinant expression vector. CY-1 /HBNA-1 
cells (ATCC CRL 10478) constitutively express EBV nuclear antigen-1 driven from 
the CMV immediate-early enhancer/promoter. CV1-EBNA-1 was derived from the 
African Green Monkey kidney cell line CV-1 (ATCC CCF 70), as described by 
McMahan et ah, UMBO J.. 10:2821 (1991). 

The transfected cells are cultured for 24 hours, and the cells in each dish then 
are split into a 24-\vell plate. After culturing an additional 48 hours, the transfected 
cells (about 4 x 10 4 cells/well) are washed with BM-NFDM, which is binding medium 
(RPMI 1640 containing 25 mg/ml bovine serum albumin, 2 mg/ml sodium azide, 20 
mM Hepes pH 7.2) to which 50 mg/ml nonfat dry milk has been added. The cells 
then are incubated for 1 hour at 37°C with various concentrations of, for example, a 
soluble polypeptide/Fc fusion protein made as set forth above. Cells then are washed 
and incubated with a constant saturating concentration of a 12:, I-mouse anti-human IgG 
in binding medium, with gentle agitation for 1 hour at 37°C. After extensive washing, 
cells are released via trypsinization. 

The mouse anti-human IgG employed above is directed against the Fc region 
of human IgG and can be obtained from Jackson Immunoresearch Laboratories, Inc., 
West Grove, PA. The antibody is radioiodinated using the standard chloraminc-T 
method. The antibody will bind to the Fc portion of any polypeptide/Fc protein that 
has bound to the cells. In all assays, non-specific binding of i; T-antibodv is assaved 



< . . . 1 1 H J I < . i a: i ; . : m ui_. ; - ^ a a i i . . ; . ; a i ■! i'.iu..:!-.: \ a ; a a a I I : ; 1 ; a , a t . ! I [ e ; 

Affmit\ calculations ( Scaichard. Ann. A.} AcjJ. S,i.. 5 1 :(>( >n ( 1 ( >4 ( > )) are venerated 





WO 00-43525 



PCT/liSOO/01338 



-44- 



Anothcr type of suitable binding assay is a competitive binding assav. To 
illustrate, biological activity of a variant may be determined by assaying for the 
variant's ability to compete with the native protein for binding to the binding partner. 

Competitive binding assays can be performed by conventional mcthodologv. 
Reagents that may be employed in competitive binding assays include radiolabeled 
SYPII and intact cells expressing the binding partner (endogenous or recombinant) on 
the cell surface. For example, a radiolabeled soluble SVPH fragment can be used to 
compete with a soluble SVPH variant for binding to cells expressing the binding 
partner on the surface. Instead of intact cells, one could substitute a soluble binding 
partner/Fc fusion protein bound to a solid phase through the interaction of Protein A 
or Protein G (on the solid phase) with the Fc moiety. Chromatography columns that 
contain Protein A and Protein G include those available from Pharmacia Biotech. Inc., 
Piscataway, NJ. 

Another type of competitive binding assay utilizes radiolabeled soluble 
binding partner, such as a soluble binding partner/Fc fusion protein, and intact cells 
expressing SVPH. Qualitative results can be obtained by competitive 
autoradiographic plate binding assays, while (Scatchard plots Scatchard. Ann. N. V. 
Acad. Sci. 51:660 (1949)) may be utilized to generate quantitative results. 

USE OF SVPH NUCLEIC ACID OR OLIGONUCLEOTIDES 

In addition to being used to express polypeptides as described above, the 
nucleic acids of the invention, including DNA. RNA, mRNA, and oligonucleotides 
thereof can be used: 



as probes to identity nucleic acid encoding proteins having proteinase 



activity; 



to identify human chromosome number 1 or 4; 



to map genes on human chromosome number 1 or 4; 
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as single-stranded sense or antisense oligonucleotides, to inhibit 
expression of polypeptides encoded by the SYPH-1, SVPH-3. or 
SVPH-4 gene; 

to detect defective genes in an individual; and 
for gene therapy. 



The nucleotides of the invention can be used as probes to identify nucleic acid 
encoding proteins having similar activity or structure. Such uses include the use of 
fragments. Such fragments may comprise any length of contiguous nucleotides. In 
one embodiment, the fragment comprises at least about 17 contiguous nucleotides of a 
DNA sequence. In other embodiments, a DNA fragment comprises at least 30, or at 
least 60, contiguous nucleotides of a DNA sequence. 

Because homologs of SEQ ID NOs;l-3 and 7-11, from other mammalian 
species are contemplated herein, probes based on the human DNA sequence of SEQ 
ID NOs:l-3 and 7-1 1 may be used to screen cDNA libraries derived from other 
mammalian species, using conventional cross-species hybridization techniques. 

Using knowledge of the genetic code in combination with the amino acid 
sequences set forth above, sets of degenerate oligonucleotides can be prepared. Such 
oligonucleotides arc useful as primers, e.g., in polymerase chain reactions (PCR), 
whereby DNA fragments arc isolated and amplified. 

C 1 h rom osomc M ap p i n g 

All or a portion of the nucleic acids of SEQ ID NOs: 1 -3 and 7-11, including 
oligonucleotides, can be used by those skilled in the art using well-known techniques 
to identify human chromosomes and the specific locus thereof, that contains the DNA 
of SVPH family members. For example, all or a portion of SUO ID NO:3. SEQ ID 
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arc not limited to. using the sequence or portions, including oligonucleotides, as a 
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resolution), in situ hybridization to chromosome spreads (moderate resolution), and 
Southern blot hybridization to hybrid cell lines containing individual human 
chromosomes (low resolution). 

For example, chromosomes can be mapped by radiation hybrid mapping. 
First, PCR is performed using the Whitehead Institute/MIT Center for Genome 
Research Genebndge4 panel of 93 radiation hybrids 
(http://www-genome.\vi. mit.edu/ftp/distribution/ 

human Sl S_releases/july97/rhmap/genebridge4.html). Primers are used which lie 
within a putative exon of the gene of interest and which amplify a product from 
human genomic DNA, but do not amplify hamster genomic DNA. The results of the 
PCRs are converted into a data vector that is submitted to the Whitehead/MIT 
Radiation Mapping site on the internet (http:/Av\v\v-scq.\vi. mit.edu). The data is 
scored and the chromosomal assignment and placement relative to known Sequence 
Tag Site (STS) markers on the radiation hybrid map is provided. The following web 
site provides additional information about radiation hybrid mapping: 
http://vvwAv-genome.wi.mit.edU/ftp/distnbution/human_STS_releases/j uly97/ 
07-97.INTRO.html). 

Identifying Associated Diseases 

As set forth below, sequences encoding SVPH-4a and SVPH-4b have been 
mapped by radiation hybrid mapping to the lpl 1-13 region of chromosome 1. That 
region is associated with specific diseases which include but are not limited to fetal 
hydantoin syndrome, diphenylhydantoin toxicity, and pheochromoevtoma. Thus, the 
nucleic acid of SEQ ID \'os:3, 10 and 1 1, or a fragment thereof, can be used bv one 
skilled in the art using well-known techniques to analyze abnormalities associated 
with SVPF1-4 genes. In addition, sequences encoding SVPH-la, SVPH-lb, and 
SVPH-lc have been mapped by radiation hybrid mapping to the 4q34 region of 



nucleic acid of SFO ID No.:2. or a fragment thereof, to analv/c abnormalities 
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this marker is rearranged or deleted. In addition, nucleic acid of SHQ ID NOs:l-3 
and 7-11 or a fragment thereof can be used as a positional marker to map other genes 
of unknown location. 

The DNA may be used in developing treatments for any disorder mediated 
(directly or indirectly) by defective, or insufficient amounts of the genes 
corresponding to the nucleic acids of the invention. Disclosure herein of native 
nucleotide sequences permits the detection of defective genes, and the replacement 
thereof with normal genes. Defective genes may be detected in in vitro diagnostic 
assays, and by comparison of a native nucleotide sequence disclosed herein with that 
of a gene derived from a person suspected of harboring a defect in this gene. 

Sense- Antisense 

Other useful fragments of the nucleic acids include antisense or sense 
oligonucleotides comprising a single-stranded nucleic acid sequence (either RNA or 
DNA) capable of binding to target mRNA (sense) or DNA (antisense) sequences. 
Antisense or sense oligonucleotides, according to the present invention, comprise a 
fragment of SEQ ID NOs: 1 -3 or 7- 1 1 . Such a fragment generally comprises at least 
about 14 nucleotides, preferably from about 14 to about 30 nucleotides. The ability to 
derive an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding 
a given protein is described in, for example. Stein and Cohen, Cancer Res, 48:2650 
(1988) and van der Krol et al., BioTechniqucs, 6:958 (1088). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences 
results in the formation of duplexes that block or inhibit protein expression by one of 
several means, including enhanced degradation of the mRNA by RNAsell, inhibition 
of splicing, premature termination of transcription or translation, or by other means. 
The antisense oligonucleotides thus may be used to block expression of proteins. 
Antisense or sense oligonucleotides further comprise oligonucleotides having 



in vm'm (/.(. ., capable ot resisting en/vmatic degradation) but retain sequence 
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Other examples of sense or antisense oligonucleotides include those 
oligonucleotides which are covalently linked to organic moieties, such as those 
described in WO 90/10448, and other moieties that increases affinity of the 
oligonucleotide for a target nucleic acid sequence, such as poly-(L -lysine), f urther 
still, intercalating agents, such as ellipticine, and alkylating agents or metal complexes 
may be attached to sense or antisense oligonucleotides to modify binding specificities 
of the antisense or sense oligonucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing 
the target nucleic acid sequence by any gene transfer method, including, for example, 
lipofection, CaP0 4 -mediated DNA transfection, electroporation. or by using gene 
transfer vectors such as Epstein-Barr virus. 

Sense or antisense oligonucleotides also may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand 
binding molecule, as described in WO 91/04753. Suitable ligand binding molecules 
include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other hgands that bind to cell surface receptors. Preferably, conjugation of the 
ligand binding molecule does not substantially interfere with the ability of the ligand 
binding molecule to bind to its corresponding molecule or receptor, or block entry of 
the sense or antisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a 
cell containing the target nucleic acid sequence by formation of an oligonucleotide- 
lipid complex, as described in WO 90 10448. The sense or antisense oligonucleotide- 
lipid complex is preferably dissociated within the cell by an endogenous lipase. 

USE OF SVPH POLYPEPTIDES AND ERAGM ENTED POLYPEPTIDES 
Uses include, but are not limited to. the following: 



Purifying proteins and measuring activity thereof 



v it m ^\ nut: a l i l. : i . 



1 > t > L' 1 C L l! 



. : i . M.ai Kci - 



Controls for peptide fragmentation 
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Idcntification of unknow n proteins 
Preparation of Antibodies 

Purification Reauents 

Each of the polypeptides of the invention finds use as a protein purification 
reauent. For example, the polypeptides may be used to purify binding partner 
proteins. In particular embodiments, a polypeptide (in any form described herein that 
is capable of binding the binding partner) is attached to a solid support by 
conventional procedures. As one example, affinity chromatography columns 
containing functional groups that will react with functional groups on amino acid side 
chains of proteins are available (Pharmacia Biotech, Inc., Piscataway, NJ). In an 
alternative, a polypeptide/Fc protein (as discussed above) is attached to Protein A- or 
Protein G-containing chromatography columns through interaction with the Fc 
moiety. 

The polypeptide also finds use in purifying or identifying cells that express the 
binding partner on the cell surface. Polypeptides are bound to a solid phase such as a 
column chromatography matrix or a similar suitable substrate. For example, magnetic 
microspheres can be coated with the polypeptides and held in an incubation vessel 
through a magnetic field. Suspensions of cell mixtures containing the binding partner 
expressing cells arc contacted with the solid phase having the polypeptides thereon. 
Cells expressing the binding partner on the cell surface bind to the fixed polypeptides, 
and unbound cells then are washed away. 

Alternatively', the polypeptides can be conjugated to a detectable moiety, then 
incubated with cells to be tested for binding partner expression. After incubation, 
unbound labeled matter is removed and the presence or absence of the detectable 
moiety on the cells is determined. 

In a further alternative, mixtures of cells suspected of containing cells 



coated beads, \\ herebv the high affinity of biotin for avidin provides binding of the 
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Bcrcnson, et al., J. Cell. IJioc/wm., 10D;23^ ( 1980). Washing to remove unbound 
material, and the release of the bound cells, are performed using conventional 
methods. 

Measuring Activity 

Polypeptides also find use in measuring the biological activity of the binding 
partner protein in terms of their binding affinity. The polypeptides thus may be 
employed by those conducting "quality assurance" studies, e.g.. to monitor shelf life 
and stability of protein under different conditions, f or example, the polypeptides may 
be employed in a binding affinity study to measure the biological activity of a binding 
partner protein that has been stored at different temperatures, or produced in different 
cell types. The proteins also may be used to determine whether biological activity is 
retained after modification of a binding partner protein (e.g., chemical modification, 
truncation, mutation, etc. ). The binding affinity of the modified binding partner 
protein is compared to that of an unmodified binding partner protein to detect any 
adverse impact of the modifications on biological activity of the binding partner. The 
biological activity of a binding partner protein thus can be ascertained before it is used 
in a research study, for example. 

Delivery Agents 

The polypeptides also find use as carriers for delivering agents attached thereto 
to cells bearing the binding partner (or to other cell types found to express the binding 
partner on the cell surface) in in vitro or in vivo procedures. 

Detectable (diagnostic) and therapeutic agents that may be attached to a 
polypeptide include, but are not limited to, toxins, other cytotoxic agents, drugs, 
radionuclides, chromophores, enzymes that catalyze a colorimetric or fluorometric 
reaction, and the like, with the particular agent being chosen according to the intended 

Radionuclides suitable lor diagnostic use include, but arc not limited to. - I. i. 
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l ' l ' ni Tc\ 111 In, and (, Br. Examples of radionuclides suitable for therapeutic use arc :;, T, 
:n At, "Br, lM> Rc, lss Rc, :i ~Pb, :i: Bi, !( "Pd, (>4 Cu, and tP Cu. 

Such agents may be attached to the polypeptide by any suitable conventional 
procedure. The polypeptide comprises functional groups on amino acid side chains 
that can be reacted with functional groups on a desired agent to form covalent bonds, 
for example. Alternatively, the protein or agent may be derivatized to generate or 
attach a desired reactive functional group. The derivatizauon may involve attachment 
of one of the Afunctional coupling reagents available for attaching various molecules 
to proteins (Pierce Chemical Company, Rock ford, Illinois). A number of techniques 
for radiolabeling proteins are known. Radionuclide metals may be attached to 
polypeptides by using a suitable bi functional chelating agent, for example. 

Conjugates comprising polypeptides and a suitable diagnostic or therapeutic 
agent (preferably covalently linked) are thus prepared. The conjugates are 
administered or otherwise employed in an amount appropriate for the particular 
application. 

Therapeutic Aeents 

Polypeptides of the invention may be used in developing treatments for any 
disorder mediated (directly or indirectly) by defective, or insufficient amounts of the 
polypeptides. These polypeptides may be administered to a mammal afflicted with 
such a disorder. 

The polypeptides may also be employed in inhibiting a biological activity of 
the binding partner, in in vitro or //; vivo procedures. For example, a purified 
polypeptide may be used to inhibit binding of the binding partner to an endogenous 
cell surface binding partner. Biological effects that result from the binding of SVPH 
to endogenous binding partner thus arc inhibited. 



In addition, an SYPH binding partner may be administered to a mammal to 
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Compositions of the present invention ma\ contain a polypeptide in any torn 
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biologically active fragments. In particular embodiments, the composition comprises 
a soluble polypeptide or an oligomer comprising soluble SVPH polypeptides or SYPH 
binding partner polypeptides. 

Compositions comprising an effective amount of a polypeptide of the present 
invention, in combination with other components such as a physiologically acceptable 
diluent, earner, or excipicnt, arc provided herein. The polypeptides can be formulated 
according to known methods used to prepare pharmaceutically useful compositions. 
Thev can be combined in admixture, either as the sole active material or with other 
known active materials suitable for a given indication, with pharmaceutically 
acceptable diluents (e.g., saline. Tris-HCL acetate, and phosphate buffered solutions), 
preservatives (e.g., thimerosal, benzyl alcohol, parabens), emulsi Tiers, solubilizers. 
adjuvants and/or carriers. Suitable formulations for pharmaceutical compositions 
include those described in Remington's Pharmaceutical Sciences, 16th cd. 1980, Mack 
Publishing Company, Easton, PA. 

In addition, such compositions can be complcxed with polyethylene glycol 
(PEG), metal ions, or incorporated into polymeric compounds such as polyacetic acid, 
polyglycolic acid, hydrogels, dextran, etc., or incorporated into liposomes, 
microemulsions, micelles, unilamellar or multilamellar vesicles, erythrocyte ghosts or 
spheroblasts. Such compositions will influence the physical state, solubility, stability, 
rate of in vivo release, and rate of //; vivo clearance, and are thus chosen according to 
the intended application. 

The compositions of the invention can be administered in any suitable manner, 
e.g.. topically, parenterally, or by inhalation. The term "parenteral" includes injection, 
e.g., bv subcutaneous, intravenous, or intramuscular routes, also including localized 
administration, e.g.. at a site of disease or injury. Sustained release from implants is 
also contemplated. One skilled in the pertinent art will recognize that suitable dosages 
will varv. depending upon such factors as the nature of the disorder to be treated, the 
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Compositions comprising nucleic acids in physiologically acceptable 
formulations are also contemplated. DNA may be formulated for injection, for 
example. 



Research Agents 

Another use of the polypeptide of the present invention is as a research tool for 
studying the biological effects that result from inhibiting binding partnerSVPH 
interactions on different cell types. Polypeptides also may be employed in in vitro 
assays for detecting the binding partner or SVPH or the interactions thereof. 

Molecular Weight, Isoelectric Point Markers 

The polypeptides of the present invention can be subjected to fragmentation 
into smaller peptides by chemical and enzymatic means, and the peptide fragments so 
produced can be used in the analysis of other proteins or polypeptides. For example, 
such peptide fragments can be used as peptide molecular weight markers, peptide 
isoelectric point markers, or in the analysis of the degree of peptide fragmentation. 
Thus, the invention also includes these polypeptides and peptide fragments, as well as 
kits to aid in the determination of the apparent molecular weight and isoelectric point 
of an unknown protein and kits to assess the degree of fragmentation of an unknown 
protein. 

Although all methods of fragmentation are encompassed by the invention, 
chemical fragmentation is a preferred embodiment, and includes the use of cvanogen 
bromide to cleave under neutral or acidic conditions such that specific cleavage occurs 
at methionine residues (E. Gross. Methods in Enz.. 1 1 :23S-255 ( 1967)). This can 
further include additional steps, such as a carboxymethylation step to convert cysteine 
residues to an unrcactive species. Table 1 summarizes the fragmentation pattern of 
SEQ in NOs:12-16 following chemical cleavage with cyanogen bromide. 



Endoproteinase Asp-N. or Endoprolemase 1 v> ( under conventional conditions to 
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cleave specifically on the carboxyl side of the asparagine residues present within the 
polypeptides of the invention. Arginylendo-peptidase can cleave specifically on the 
carboxvl side of the arginme residues present within these polypeptides. 
Achromohactcr protease 1 can cleave specifically on the carboxyl side of the lysine 
residues present within the polypeptides (Sakiyama and Nakat, U.S. Patent 
No. 5,248,599; T. Masaki et al., Biochim. Biophys. Acta, 660:44-50 (1981 ); T. Masaki 
et al., Biochim. Biophys. Acta, 660:51-55 (1981 ')). Trypsin can cleave specifically on 
the carboxyl side of the arginine and lysine residues present within polypeptides of the 
invention. Enzymatic fragmentation may also occur with a protease that cleaves at 
multiple amino acid residues. For example, Staphlococcus aureus V8 protease can 
cleave specifically on the carboxyl side of the aspartic and glutamic acid residues 
present within polypeptides (D. W. Cleveland,./. Biol. Chcm^ 3A 102-1 106 (1977)). 
Endoproteinase Asp-N can cleave specifically on the amino side of the asparagine 
residues present within polypeptides. Endoproteinase Lys-C can cleave specifically 
on the carboxyl side of the lysine residues present within polypeptides of the 
invention. Other enzymatic and chemical treatments can likewise be used to 
specifically fragment these polypeptides into a unique set of specific peptides. 

Of course, the peptides and fragments of the polypeptides of the invention can 
also be produced by conventional recombinant processes and synthetic processes well 
known in the art. With regard to recombinant processes, the polypeptides and peptide 
fragments encompassed by invention can have variable molecular weights, depending 
upon the host cell in which they arc expressed. Glycosy Iation of polypeptides and 
peptide fragments of the invention in various cell types can result in variations of the 
molecular weight of these pieces, depending upon the extent of modification. The 
size of these pieces can be most heterogeneous with fragments of polypeptide derived 
from the extracellular portion of the polypeptide. Consistent polypeptides and peptide 
fragments can be obtained by using polypeptides derived entirely from the 
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and carbox\ 1 terminal ends of polypeptides of the invention ean be used to enhance 
expression of these polypeptides or aid in the purification of the protein. In addition- 
fusions of additional peptide sequences at the amino and carboxyl terminal ends of 
polypeptides of the invention will alter some, but usually not all, of the fragmented 
peptides of the polypeptides generated by enzymatic or chemical treatment. Of 
course, mutations can be introduced into polypeptides of the invention using routine 
and known techniques of molecular biology. For example, a mutation can be 
designed so as to eliminate a site of proteolytic cleavage by a specific enzyme or a site 
of cleavage by a specific chemically induced fragmentation procedure. The 
elimination of the site will alter the peptide fingerprint of polypeptides of the 
invention upon fragmentation with the specific enzyme or chemical procedure. 

When the invention relates to the use of fragmented peptide molecular weight 
markers, those markers are preferably at least 10 amino acids in size. More 
preferably, these fragmented peptide molecular weight markers are between 10 and 
1 00 amino acids in size. Even more preferable arc fragmented peptide molecular 
weight markers between 10 and 50 amino acids in size and especially between 10 and 
35 amino acids in size. Most preferable are fragmented peptide molecular weight 
markers between 10 and 20 amino acids in size. 

Because the unique amino acid sequence of each fragment specifies a 
molecular weight, these fragments can thereafter serve as molecular weight markers 
using such analysis techniques to assist in the determination of the molecular weight 
of an unknown nrotein, polypeptides or fragments thereof The molecular weight 
markers of the invention serve particularly well as molecular weight markers for the 
estimation of the apparent molecular weight of proteins that have similar apparent 
molecular weights and, consequently, allow increased accuracy in the determination 
of apparent molecular weight of proteins. 

Among the methods for determining molecular weight are sedimentation, gel 
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conditions allow s for increased accuracy. It is understood, of course, that many 
different techniques can be used for the determination of the molecular weight of an 
unknown protein using polypeptides of the invention, and that this embodiment in no 
way limits the scope of the invention. 

In addition, each unglycosylated polypeptide or fragment thereof has a pi that 
is intrinsically determined by its unique ammo acid sequence (which pi can be 
estimated by the skilled artisan using any of the computer programs designed to 
predict pi values currently available, calculated using any well-known amino acid pKa 
table, or measured empirically). Therefore these polypeptides and fragments thereof 
can serve as specific markers to assist in the determination of the isbelectric point of 
an unknown protein, polypeptide, or fragmented peptide using techniques such as 
isoelectric focusing. These polypeptide or fragmented peptide markers serve 
particularly well for the estimation of apparent isoelectric points of unknown proteins 
that have apparent isoelectric points close to that of the polypeptide or fragmented 
peptide markers of the invention. 

The technique of isoelectric focusing can be further combined with other 
techniques such as gel electrophoresis to simultaneously separate a protein on the 
basis of molecular weight and charge. The ability to simultaneously resolve these 
polypeptide or fragmented peptide markers and the unknown protein under identical 
conditions allows for increased accuracy in the determination of the apparent 
isoelectric point of the unknown protein. This is of particular interest in techniques, 
such as two dimensional electrophoresis (T.D. Brock and M.T. Madigan, Biology of 
Microorganisms 7<>-77. Prentice Hall. 6th ed. ( 1991 )), where the nature of the 
procedure dictates that any markers should be resolved simultaneously with the 
unknown protein. In addition, with such methods, these polypeptides and fragmented 
peptides thereof can assist in the determination of both the isoelectric point and 
molecular weight of an unknown protein or fragmented peptide. 



molecular w eight markers of the invention can be visuaii/ed usnm antibodies 
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dctcction is performed under conventional conditions that do not result in the 
detection of the unknown protein. It is understood that it may not he possible to 
generate antibodies against all polypeptide fragments of the invention, since small 
peptides may not contain immunogenic epitopes. It is further understood that not all 
antibodies will work in this assay; however, those antibodies which are able to bind 
polypeptides and fragments of the invention can be readily determined using 
conventional techniques. 

The unknown protein is also visualized by using a conventional staining 
procedure. The molar excess of unknow n protein to polypeptide or fragmented 
peptide molecular weight markers of the invention is such that the conventional 
staining procedure predominantly detects the unknown protein. The level of these 
polypeptide or fragmented peptide molecular weight markers is such as to allow little 
or no detection of these markers by the conventional staining method. The preferred 
molar excess of unknown protein to polypeptide molecular weight markers of the 
invention is between 2 and 100,000 fold. More preferably, the preferred molar excess 
of unknown protein to these polypeptide molecular weight markers is between 10 and 
10,000 fold and especially between 100 and 1,000 fold. 

It is understood of course that many techniques can be used for the 
determination and detection of molecular weight and isoelectric point of an unknown 
protein, polypeptides, and fragmented peptides thereof using these polypeptide 
molecular weight markers and peptide fragments thereof and that these embodiments 
in no way limit the scope of the invention. 

In another embodiment, the analysis of the progressive fragmentation of the 
polypeptides of the invention into specific peptides (D. YY. Cleveland et aL Biol. 
Chcm. 252:1 102-1 106 (1977)). such as by altering the time or temperature of the 
fragmentation reaction, can be used as a control for the extent of cleavage of an 
unknown protein. For example, cleavage of the same amount of poly peptide and 
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ID NOs:4-6 and 12-16 with cyanogen bromide in the absence of glycosylation 
generates a unique set of fragmented peptide molecular weight markers with 
molecular weights as set forth in Table 1 on the following page. 

Table 1 . Molecular Weights of Peptide Fragments Generated by Cyanogen Bromide 
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detcrmining molecular weight over the range of the molecular w eights of the 
fragment. 

In addition, the preferred purified polypeptides of the invention (SFQ ID 
NOs:4-6 and 12-16) have calculated molecular w eights of approximately 4.199; 
13,938; 55,209; 86,983; 89459; 92,781; 88,923; and 87,990 Daltons, respectively. 
Thus, where an intact protein is used, the use of these polypeptide molecular weight 
markers allows increased accuracy in the determination of apparent molecular weight 
of proteins that have apparent molecular w eights close to these weights. 

Finally, as to the kits that are encompassed by the invention, the constituents 
of such kits can be varied, but typically contain the polypeptide and fragmented 
peptide molecular weight markers. Also, such kits can contain the polypeptides 
wherein a site necessary for fragmentation has been removed. Furthermore, the kits 
can contain reagents for the specific cleavage of the polypeptide and the unknow n 
protein by chemical or enzymatic cleavage. Kits can further contain antibodies 
directed against polypeptides or fragments thereof of the invention. 

Identification of Unknown Proteins 

As set forth above, a polypeptide or peptide fingerprint can be entered into or 
compared to a database of known proteins to assist in the identification of the 
unknown protein using mass spectrometry (W.J. Henzcl et al., Proc. Natl Acad. Sci. 
USA, 90:501 1-5015 (1993); D. Fenyo et al., Electrophoresis, 1 9:998-1 005 ( 1 998)). A 
variety of computer software programs to facilitate these comparisons are accessible 
via the Internet, such as Protein Prospector (Internet site: prospector.uscf.edu). 
Multildcnt (Internet site: www .expasy.ch sprot multiident.html). PeptideSearch 
(Internet si te:www.mann.embl-heiedelberg.de...deSearch / I : R PeptideSearch 
Form.html), and ProFound (Internet site:www. chait-sgi. rockefeller. edu/cgi-bin 
prot-id-frag.html). These programs allow the user to specify the cleavage agent and 
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In addition, a polypeptide or peptide digest can be sequenced using tandem 
mass spectrometry (MS 'MS) and the resulting sequence searched against databases 
(J.K. Eng. et al., J. Am. Sac. Mass Spec, 5 :970-9S9 ( 1 994); M. Mann and M. Wilm. 
Anal. Chcm.* 66:4390-4399 (1994); J. A. Taylor and R.S. Johnson, Rapid Comm. Mass 
Spec, 1 1:1067-1075 (1997)). Searching programs that can be used in this process 
exist on the Internet, such as Lutcflsk 97 (Internet site: 

www. Isbc.com:70/Lutefisk97. html), and the Protein Prospector, Peptide Search and 
Propound programs described above. 

Therefore, adding the sequence of a gene and its predicted protein sequence 
and peptide fragments to a sequence database can aid in the identification of unknown 
proteins using mass spectrometry. 



Antibodies that are immunoreactive with the polypeptides of the invention are 
provided herein. Such antibodies specifically bind to the polypeptides via the antigen- 
binding sites of the antibody (as opposed to non-specific binding). Thus, the 
polypeptides, fragments, variants, fusion proteins, etc., as set forth above may be 
employed as "immunogens" in producing antibodies immunoreactive therewith. More 
specifically, the polypeptides, fragment, variants, fusion proteins, etc. contain 
antigenic determinants or epitopes that elicit the formation of antibodies. 

These antigenic determinants or epitopes can be either linear or 
conformational (discontinuous ). l inear epitopes are composed of a single section of 
amino acids of the polypeptide, while conformational or discontinuous epitopes are 
composed of amino acids sections from different regions of the polypeptide chain that 
are brought into close proximity upon protein folding (C. A. Janeway, Jr. and P. 
T ravers, Immuno Biology 3:9. Garland Publishing Inc., 2nd ed. (1996)). Because 
folded proteins have complex surfaces, the number of epitopes available is quite 
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Thus, one aspect of the present invention relates to the antigenic epitopes of 
the polypeptides of the invention. Such epitopes are useful for raising antibodies, in 
particular monoclonal antibodies, as described in more detail below. Additionally, 
epitopes from the polypeptides of the invention can be used as research reagents, in 
assays, and to purify specific binding antibodies from substances such as polyclonal 
sera or supernatants from cultured hybridomas. Such epitopes or variants thereof can 
be produced using techniques well known in the art such as solid-phase synthesis, 
chemical or enzymatic cleavage of a polypeptide, or using recombinant DNA 
technology. 

As to the antibodies that can be elicited by the epitopes of the polypeptides of 
the invention, whether the epitopes have been isolated or remain part of the 
polypeptides, both polyclonal and monoclonal antibodies may be prepared by 
conventional techniques. See, for example. Monoclonal Antibodies, Hybridomas: A 
New Dimension in Biological Analyses, Kennet et al. (eds.), Plenum Press, New York 
(1980); and Antibodies: A Laboratoiy Manual, Harlow and Land (eds. ), Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, (1988). 

Hybridoma cell lines that produce monoclonal antibodies specific for the 
polypeptides of the invention are also contemplated herein. Such hybridomas may be 
produced and identified by conventional techniques. One method for producing such 
a hybridoma cell line comprises immunizing an animal with a polypeptide; harvesting 
spleen cells from the immunized animal; fusing said spleen cells to a myeloma cell 
line, therebv generating hybridoma cells; and identifying a hybridoma cell line that 
produces a monoclonal antibody that binds the polypeptide. The monoclonal 
antibodies may be recovered by conventional techniques. 

The monoclonal antibodies of the present invention include chimeric 
antibodies, e.g., humanized versions of murine monoclonal antibodies. Such 
humanized antibodies may be prepared by known techniques and oiler the advantage 
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rcgion fragment (lacking the antigen-binding site) derived from a human antibody. 
Procedures for the production of chimeric and further engineered monoclonal 
antibodies include those described in Riechmann et al.. Xaturc. 332:323 (198S); Liu et 
ah. PNAS, <SV:3439 (1987); Larrick et al., Bio/Technology, 7:934 (1989), and Winter 
and Harris, TIPS, 77:139 (May 1993). Procedures to generate antibodies 
transgenically can be found in GB 2,272,440, US Patent Kos. 5,569,S25 and 
5,545,806 and related patents claiming priority therefrom, all of which are 
incorporated by reference herein. 

Antigen-binding fragments of the antibodies, which may be produced by 
conventional techniques, are also encompassed by the present invention. Examples of 
such fragments include, but are not limited to, Fab and F(ab') 2 fragments. Antibody 
fragments and derivatives produced by genetic engineering techniques are also 
provided. 

In one embodiment, the antibodies are specific for the polypeptides of the 
present invention and do not cross-react with other proteins. Screening procedures by 
which such antibodies may be identified are well known, and may involve 
immunoaffinity chromatography, for example. 

Uses Thereof 

The antibodies of the invention can be used in assays to detect the presence of 
the polypeptides or fragments of the invention, either in vitro or in vivo. The 
antibodies also may be employed in purifying polypeptides or fragments of the 
invention by immunoaffinity chromatography. 

Those antibodies that additionally can block binding of the polypeptides of the 
invention to the binding partner may be used to inhibit a biological activity that results 
from such binding. Such blocking antibodies may be identified using any suitable 
assav procedure, such as by testing antibodies for the ability to inhibit binding of 
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Such an antibody may be employed in an in vitro procedure, or administered 
/;/ vivo to inhibit a biological activity mediated by the entity that generated the 
antibody. Disorders caused or exacerbated (directly or indirectly) by the interaction of 
SVPH with cell surface binding partner thus may be treated. A therapeutic method 
involves in vivo administration of a blocking antibody to a mammal in an amount 
effective in inhibiting an SVPH-binding partner-mediated biological activity. 
Monoclonal antibodies are generally preferred for use m such therapeutic methods. In 
one embodiment, an antigen-binding antibody fragment is employed. 

Antibodies may be screened for agonistic (i.e., ligand-mimicking) properties. 
Such antibodies, upon binding to cell surface hinging partner, induce biological 
effects (e.g., transduction of biological signals) similar to the biological effects 
induced when SVPH binds to cell surface binding partner. 

Compositions comprising an antibody that is directed against SVPH or SVPH 
binding partner, and a physiologically acceptable diluent, excipient, or carrier, are 
provided herein. Suitable components of such compositions are as described above 
for compositions containing SVPH or SVPH binding partner proteins. 

Also provided herein are conjugates comprising a detectable (e.g., diagnostic) 
or therapeutic agent, attached to the antibody. 

The following examples are provided to further illustrate particular 
embodiments of the invention, and are not to be construed as limiting the scope of the 
present invention. 

EXAMPLE 1: Isolation of S VPH Nucleic Acids 

A search of the GenBank DNA sequence database revealed two fiSTs that 
share homology with ADAM20 and A DAM 2 1 . XS559S showed similarity to the 
Cys-rich region of ADAM20. while A12 14466 showed similarity to the same region 
in A DAM 2 I . Both ESTs were derived from testis mRNA 
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labcled deoxyoligonucleotidcs ( 5'-C ACCTAAGGTGTTCAATTCTTTG-3' (SEQ ID 
NO: 17), 5'-CAAATACTGCAAGTGAGACTTGC-3' (SEQ ID NO: 18). 
S'-TGCACAACTACGTGTGGTG TACCC-3* (SEQ ID NO: 10). and 
5'-GAGCCACTGCAATTGAAAAAGTGCCC-3' (SEQ ID NO:20). SVPII-4 clones 
were isolated under the same conditions usinu 

32 P 

-labeled deoxvolmonucleotides 
( AATGATGCTCTTGCA TGGTCG (SEQ ID NO:21 ), 
CTTTGAGGGAGGCCATGTAG n GCAG (SEQ ID NO:22), and 
TGAAGGAGAAAACGCGCAGATGTCGG (SEQ ID NO:23). DNAs from 
positively hybridizing phages were purified and eharacterized by restriction 
endonucleasc mapping. Southern blot analysis, and DNA sequencing. 



EXAMPLE 2: DNA Sequence Analysis of SVPH 

SVPH-Ic has an open reading frame of 820 amino acids (GenBank accession 
number AF 171 929) that encodes all of the ADAMs domains, inluding a signal 
sequence, pro-domain with a Cys switch, catalytic domain with a zinc-binding motif 
and a Met-turn, disintegrin domain, cysteine-rich domain, a transmembrane domain, 
and a cytoplasmic domain. However, SVPH-lc (as well as SVPH- la and SVPH- lb) 
has a His residue (His 333) instead of a Glu residue in the zinc-binding motif that may 
affect catalytic activity. The Glu residue binds a water molecule via hydrogen binding 
and is required for enzymatic activity (Stocker, W. et ah, Protein Sci., 4:823-840 
(1995)). SVPH-la and SVPH-lb represent alternative forms of SVPH-lc with 
differences in the cytoplasmic domain. SVPH-la has a deletion of 54 amino acids 
resulting in a protein of 760 amino acids (GenBank accession number AP I 71 ( >30), 
while SVPH-lb has a divergent 38 amino acid C-terminus resulting in a protein with 
787 amino acids (GenBank accession number AP I 71931 ). These three forms of 
SVPI1-1 encode cytoplasmic domains of 121, 67, and 88 amino acids, respectively. 
An unusual feature of the cytoplasmic domain of SVPH-lc is the sequence 
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catalytic domain. One cDNA, presumably from an alternative RNA splicing event, 
deletes nine amino acids in the cytoplasmic domain and has been designated SVPH- 
4b (GenBank accession number AF171933). Interestingly, SVPH-4a and SVPH-4b 
contain a repeat sequence, QEFSK(T/A)KTG (SFQ ID NO:33), in the cytoplasmic 
domain, which was not found in GenBank. 

As noted above, SVPIl-la, SVPH-lb, and SVPH-lc diverge from the 
consensus zinc-binding cluster (HEXXI1XXGXXHD) (SEQ ID NO:31 ) in the 
catalytic domain with a Glu to His change at position 333. To analyze these proteins 
further, DNA and protein sequence multiple alignments of all known mammalian 
ADAMs fhttp://wvvw.med.virginia.edu / -jag6n/adams.html) were produced using the 
PILEUP program from the Wisconsin Package (Wisconsin Package 10.1, Genetics 
Computer Group, Madison, WI). Protein multiple alignments were generated using 
the modified PAM scoring matrix of Gribskov and Burgess (Gribskov, M. et al., 
Nucleic Acids Res., 14:6745-6763 (1986)) provided in the Wisconsin Package, with 
gap-open and gap-extend penalties of 30 and 1, respectively. Nucleic acid multiple 
alignments were generated using a scoring matrix with A, C, G, T matches scoring 
unity, mismatches scoring zero, and gap-open and gap-extend penalties of 5 and 1 
respectively. Unrooted maximum parsimony trees were estimated by the Wisconsin 
Package implementation of PAUP (version 4.0), starting from multiple alignments 
produced by PILEUP. PAUP parameters were set to use accelerated transformation 
character-state optimization with unordered, equally weighted characters. 

This alignment was used to infer a maximum parsimony phytogeny (Fig. 2). 
Due to the large number of taxa involved, the phylogeny was inferred using a heuristic 
tree search, which does not perform an exhaustive search of all possible tree 
topologies. Examination of the phylogenetic tree revealed an interesting pattern with 
respect to the presence of a zinc-binding motif. The ADAM sequences can be divided 
into two well-separated regions of the phylogeny, as marked by the arrow in Fig. 2. 

(ADAMs 4. (k 11. 22, 23. and SYP11-1 > presumably arose from a catalytically 
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example. ADAM4, ADAM7, and SVPH-1 all possess the three His residues and the 
Asp after the third conserved His. Finally, the corresponding region in ADAMs 2, 3, 
5, 18 and 27 is quite distinct. As these sequences form clusters quite divergent from 
the zinc-binding site-containing ADAMs, it is most likely that the zinc-binding site 
arose once in the common ancestor to the ADAMs and was lost in those lineages 
which do not possess a zinc-binding site (denoted by an 'X' in Fig. 2). 



EXAMPLE 3: Chromosome Mapping of SVPH 

Radiation hybrid mapping (Walter, et aL Nat. Genet., 7: 22-28 (1994)) was 
done using the GeneBridge 4 radiation-hybrid mapping panel (Research Genetics, 
Huntsville, AL). The panel was screened with specific primer pairs for SVPH-1 
(sense: 5'-TCGATAATGCATGAAGGCAACCCACC-3 t (SEQ ID NO:24) and 
antisense: 5'-CAAGTCTCACTTGCAGTATTTGCGCC-3' (SEQ ID NO:25), and 
SVPH-4 (sense; 5*-GCCACTGCATGTATGGGTG-3' (SEQ ID NO:26) and antisense: 
5'-GACACTCTTTGCTTTGGGTCG-3' (SEQ ID NO:27) which generated products 
of 298 and 263 bp, respectively. PCR products were subjected to Southern blot 
analysis using an internal oligonucleotide probe specific for each gene. Data from 
two independent PCR screenings for each primer pair were scored against STS 
markers from the Whitehead Institute/MIT Center for Genome Research database 
using the statistical program RHMAPPER. LOD scores were >3.() in all cases, 

SVPH-la, SVPH-lb, and SVPH-lc were mapped to chromosome 4q34, 1.51 
cR distal from AI : M312WG1 . The sequential order of known markers relative to 
SVPH-1 on the Whitehead framework map was D4S1545, PDGH 

(Hydroxyprostaglandin Dehyrogenasc 1 5 ) S VPH- 1 WI-2 1 773 GPM6A (Glycoprotein 
M6A). This region is syntenic to mouse chromosome 8. SVPH-4a and SVPH-4b 
were mapped to chromosome lpl 1-13, 1.65 cR distal to D1S453. The sequential 
order of markers relative to SVPH-4 on the Whitehead framework map was CD2 
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EXAMPLK 4: Tissue Distribution of SVPH 

Northern blot analysis was used to determine the tissue distribution of SVPH- 1 
and SVPH-4. Northern blots were purchased from Clontech (catalog number 7760-1. 
7759- 1 , 7755- 1 , 7750- 1 ). Each lane contained approximately 2 jag of the indicated 
poly A' RNA. The blots were treated with Stark's buffer (50% fonnamide, 50mM 
KPO4. 5 \ SSC, 1% SDS, 5X Denhardt's, 0.05% sarcosyl, 300mg/ml salmon sperm 
DNA) at 63°C for at least Ih and then probed with 3-P-labeled riboprobes in Stark's 
buffer at 63°C\ overnight (Cosman et al.. Nature, 312:768-771 (1984)). Blots were 
then sequentially washed to high stringency (0.1 x SSC, 0.1% SDS, 63°C) and 
exposed to film. Films were developed in an automated x-ray film processor. SVPH- 
1 (nt 1068 to 1786 of SEQ ID NOs:7-9) and SVPIf-4(nt 1343 to 1779 ofSFQ ID 
NOs: 10-1 1 ) anti-sense riboprobes were prepared by in vitro transcription from a T7 
RNA promoter with a commercially available kit (MAXIscript, Ambion, Inc., Austin, 
TX) using [a-32p]-UTP as the labeled nucleotide. 

As indicated in Figure 1, both SVPH-1 and SVPH-4 were specifically 
expressed in testes with a single mRNA species of approximately 3.0 kb. No signals 
were detected in the other RNA samples. 

EXAMPLE 5: Monoclonal Antibodies 

This example illustrates a method for preparing monoclonal antibodies that 
bind an SVPH-1, SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b 
polypeptide. Suitable immunogens that may be employed in generating such 
antibodies include, but are not limited to. purified SVPH-1. SVPH-la. SVPH- lb. 
SYPH-lc. SVPH-4, SVPH-4a or SVPH-4b polypeptide or an immunogenic fragment 
thereof such as the extracellular domain, or fusion proteins containing SVPH-1, 
SVPH-la, SVPH-lb. SYPH-lc, SVPH-4. SVPH-4a or SVPH-4b (e.g., a soluble 
SYPH-1 Fe fusion protein). 

n . ■ * ■ ; * ' ic^'PM 1 QVnn 1 v^'PM 11, <\-nn 1 ^vpn } <^voij 1 » ,>>■ 



Brieflv. mice arc immum/ed with SYPH-1, SVPH-la. SVPH-lb. SYPH-lc. SYPH-4. 
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injected in amounts ranging from 10-100 jig subcutaneously or intraperitoneal^ . Ten 
to twelve days later, the immunized animals are boosted with additional immunogen 
emulsified in incomplete Freund's adjuvant. Mice are periodically boosted thereafter 
on a weekly to bi-weekly immunization schedule. Serum samples are periodically 
taken by retro-orbital bleeding or tail-tip excision to test for SVPH-1, SVPH-1 a, 
SVPH-lb, SVPIMc, SVPH-4, SVPH-4a or SVPH-4b antibodies by dot blot assay, 
EL1SA (Enzyme-Linked Immunosorbent Assay) or inhibition of binding partner 
binding. 

Following detection of an appropriate antibody titer, positive animals are 
provided one last intravenous injection of SVPH-1, SVPH-la, SVPH-lb, SVPH-1 c, 
SVPH-4, S VPI I~4a or S VP! L4b in saline. Three to four days later, the animals are 
sacrificed, spleen cells harvested, and spleen cells are fused to a murine myeloma cell 
line, e.g., NS1 or preferably P3x63Ag8.653 (ATCC CRL 1580). Fusions generate 
hybridoma cells, which are plated in multiple microtiter plates in a HAT 
(hypoxanthine, aminopterin and thymidine) selective medium to inhibit proliferation 
of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells are screened by ELISA for reactivity against purified 
SVPH-1, SVPH-la, SVPH-lb, SVPH-1 c, S VPH-4, SVPH-4a or SVPH-4b by 
adaptations of the techniques disclosed in Engvall et al., Immunochcm. 8:871, 1971 
and in U.S. Patent 4,703,004. A preferred screening technique is the antibody capture 
technique described in Beckmann et aL, ,/. Immunol. 1 44:42 12,(1 990). Positive 
hybridoma cells can be injected intrapcritoneally into syngeneic BALB.c mice to 
produce ascites containing high concentrations of anti-SVPH-1 , SVP1 1-1 a. SVPi Lib. 
SVPH-1 c, S VPH-4, SVPH-4a or SVPH-4b monoclonal antibodies. Alternatively, 
hybridoma cells can be grow n /'/; vitro in flasks or roller bottles by various techniques. 
Monoclonal antibodies produced in mouse ascites can be purified by ammonium 
sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
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EXAMPLE 6: Binding Assay 



Full length SYPH-1, SYPH-1 a, SYPH-lb, SVPH-lc, SVPH-4, SVPII-4a or 
SVPH-4b is expressed and tested for the ability to bind its binding partner. The 
binding assay is conducted as follows. 

A fusion protein comprising a leucine zipper peptide fused to the N-tcrminus 
of a soluble binding partner polypeptide (LZ-binding partner) is employed in the 
assay. An expression construct is prepared, essentially as described for preparation of 
the FLAG'S -binding partner expression construct in Wiley et al.. Immunity, 3:673- 
682, (1995), which is hereby incorporated by reference, except that DNA encoding the 
FLAG® peptide is replaced with a sequence encoding a modified leucine zipper that 
allows for trimerization. The construct, in expression vector pDC409, encodes a 
leader sequence derived from human cytomegalovirus, followed by the leucine zipper 
moiety fused to the N-terminus of a soluble binding partner polypeptide. The LZ- 
binding partner is expressed in CHO cells, and can be purified from the culture 
supernatant. 

The expression vector designated pDC409 is a mammalian expression vector 
derived from the pDC406 vector described in McMahan et ah, EM BO J. 10:2821- 
2832, (1991 ), which is hereby incorporated by reference. Features added to pDC4()9 
(compared to pDC406) include additional unique restriction sites in the multiple 
cloning site (mcs); three stop codons (one in each reading frame) positioned 
downstream of the mcs; and a T7 polymerase promoter, downstream of the mcs, that 
facilitates sequencing of DNA inserted into the mcs. 

For expression of full length human SVPII-1, SVPH-la, SVPH-lb, SVPH-lc, 
SYPH-4, SVPH-4a or SVPH-4H protein, the entire coding region (i.e.. the DNA 
sequence presented in SEQ ID NO: 1 , SFQ ID NO:3, SEQ ID NO:7, SEQ ID NO:S, 
SEQ ID NO:9, SEQ ID NO: 10, or SEQ ID NO:l 1 ) is amplified by polymerase chain 
reaction (PCR). The isolated and amplified DNA is inserted into the expression 



or SYPII-4b polypeptide, as discussed above. Cells are cultured in DMEM 
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4S hours after iransfection. cells are detached non-enzymatically and incubated with 
LZ-binding partner (5 mg/ml). a biotinylatcd anti-LZ monoclonal antibody (5 nig ml), 
and phycoerythrin-conjugated strcptavidin ( 1 :400), before analysis by fluorescence- 
activated cell scanning (FACS). The cytometric analysis is conducted on a FACscan 
(Beckton Dickinson, San Jose. CA). 

The cells expressing LZ-binding partner w ill show significantly enhanced 
binding of SVPH-K SVPH-la, SVPH-lb, SVPH-lc, SVP1I-4, SVPH-4a or SVPH-4H, 
compared to the control cells not expressing LZ-binding partner. 

The specification is most thoroughly understood in light of the teachings of the 
references cited within the specification which are hereby incorporated by reference. 
The embodiments within the specification provide an illustration of embodiments of 
the invention and should not be construed to limit the scope of the invention. The 
skilled artisan readily recognizes that many other embodiments are encompassed by 
the invention. 
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What is claimed is: 

1. An isolated SVPH nucleic acid molecule selected from the group 
consisting of: 

(a) the DNA sequence selected from the group consisting of SEQ ID 
NO:l, SEQ ID NO:7, SEQ ID NO:S, and SEQ ID NO:9; 

(b) an isolated nucleic acid molecule encoding an amino acid sequence 
comprising the sequence selected from the group consisting of SEQ ID NO:4, SEQ 1 1) 
NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14; 

(c) an isolated nucleic acid molecule that hybridizes to cither strand of 
a denatured, double-stranded DNA comprising the nucleic acid sequence of (a) or (b) 
under conditions of moderate stringency in 50% lormamide and 6XSSC ", at 41 C with 
washing conditions of 60 C, 0.5XSSC, 0.1% SDS; 

(d) an isolated nucleic acid molecule derived by in vitro mutagenesis 
from SEQ ID NO: 1 , SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9; 

(e) an isolated nucleic acid molecule degenerate from SEQ ID NO: I, 
SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9 as a result of the genetic code; and 

(f) an isolated nucleic acid molecule selected from the group consisting 
of human SVPH 1 DNA; an allelic variant of human SVPH 1 DNA; and a species 
homologofSVPH 1 DNA. 

2. The nucleic acid molecule of claim 1 selected from the group 
consisting of SEQ ID NO: 1, SEQ ID NO:7, SEQ ID NO:8. and SEQ ID NO: 1 ). 

?v A recombinant vector that directs the expression of the nucleic acid 
molecule of claim 1 . 

4. An isolated polypeptide encoded by the nucleic acid molecule of claim 

1. 

5. An isolated polypeptide according to claim 4 having a molecular 

w eight selected from the group consisting of approximately 4.199; 86.983: 89,459; 



Isolated antibodies according to claim . wherein the antibodies are 
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9 A host cell transfected or transduced with the vector of claim 3. 

10. A method for the production of SVPI I 1 polypeptide comprising 
culturing a host cell of claim 9 under conditions promoting expression, and recovering 
the polypeptide from the culture medium. 

1 1. The method of claim 10, wherein the host cell is selected from the 
group consisting of bacterial cells, yeast cells, plant cells, and animal cells. 

12. The method of claim 10, wherein the host cell is a mammalian cell. 

13. An isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO:4, SEQ ID NO: 12, SEQ ID NO: 13, and 
SEQ ID NO: 14. 

14. An oligomer comprising a polypeptide of claim 4. 

15. .An isolated SVPH nucleic acid molecule selected from the group 
consisting of: 

(a) the DNA sequence selected from the group consisting of SEQ ID 
NO:3, SEQ ID NO: 10, and SEQ ID NO:ll; 

(b) an isolated nucleic acid molecule encoding an amino acid sequence 
comprising the sequence selected from the group consisting of SEQ ID NO:6, SEQ ID 
NO: 15, and SEQ ID NO: 16; 

(c) an isolated nucleic acid molecule that hybridizes to either strand of 
a denatured, double-stranded DNA comprising the nucleic acid sequence of (a) or (b) 
under conditions of moderate stringency in 50% formamide and 6XSSC, at 42 ' C with 
washing conditions of 60 ! C\ 0.5XSSC, 0.1% SDS; 

(d) an isolated nucleic acid molecule derived by //; vitro mutagenesis 
from SEQ ID NO:3, SEQ ID NO: 1 0, and SEQ ID NO:l 1; 

(e) an isolated nucleic acid molecule degenerate from SEQ ID NO:3, 
SEQ ID NO: 10, and SEQ ID NO:l 1 as a result of the genetic code; and 

(0 an isolated nucleic acid molecule selected from the group consisting 
, f i, <\ 'pi! ! n\ \ ■ -.n -ill^ii.- vtri.'irT nniinn.'m ^VPM 4 H\' \ ■ - r id ;< ^p.-riiH- 
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16. The nucleic acid molecule of claim 15 selected from the group 
consisting of SEQ ID NO:3. SEQ ID NO: 10. and SEQ ID NOT 1 . 

1 7. A recombinant vector that directs the expression of the nucleic acid 
molecule of claim 15. 

18. An isolated polypeptide encoded by the nucleic acid molecule of claim 

15. 

19. An isolated polypeptide according to claim 18 having a molecular 
weight selected from the group consisting of approximately 55,209; 88,923; and 
87,990 Daltons as determined by SDS-PAGE. 

20. Aji isolated polypeptide according to claim 18 in non-glycosylated 

form. 

2 1 . Isolated antibodies that bind to a polypeptide of claim 1 8. 

22. Isolated antibodies according to claim 21, wherein the antibodies arc 
monoclonal antibodies. 

23. A host cell transfected or transduced with the vector of claim 1 7. 

24. A method for the production of SVPH 4 polypeptide comprising 
culturing a host cell of claim 23 under conditions promoting expression, and 
recovering the polypeptide from the culture medium. 

25. The method of claim 24. wherein the host cell is selected from the 
group consisting of bacterial cells, yeast cells, plant cells, and animal cells. 

26. The method of claim 24, wherein the host cell is a mammalian cell. 

27. An isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO:o, SEQ ID NO: 15, and SEQ ID NO: 1 6. 

28. An oligomer comprising a polypeptide of claim 1 8. 

29. An isolated SVPH nucleic acid molecule selected from the group 
consisting of: 

(a) the DNA sequence of SEQ ID NO:2; 

;;.J-it.*,i nnrE'i.- 'KM,' mi iIcvmIo oncodim* an amine 1 " acid seciuence 



a denatured, double-stranded DNA comprising the nucleic acid sequence ot (a) or <h) 
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under conditions of moderate stringency in 50% formamide and 6XSSC, at 42 C with 
washing conditions of (>0 C\ 0.5XSSC, 0.1% SDS; 

(d) an isolated nucleic acid molecule derived by in vitro mutagenesis 

from SEQ ID NO:2; 

(c) an isolated nucleic acid molecule degenerate from SEQ ID NO:2 as 

a result of the genetic code; and 

(0 an isolated nucleic acid molecule selected from the group consisting 
of human SVPH 3 DNA; an allelic variant of human SVPH 3 DNA; and a species 
homologof SVPH 3 DNA. 

30. The nucleic acid molecule of claim 29, wherein the DNA sequence 

comprises SEQ IDNO:2. 

31 . A recombinant vector that directs the expression of the nucleic acid 

molecule of claim 29. 

32. An isolated polypeptide encoded by the nucleic acid molecule of claim 

29. 

33. An isolated polypeptide according to claim 32 having a molecular 
weight of approximately 13,938 Daltons as determined by SDS-PAGE. 

34. An isolated polypeptide according to claim 32 in non-glycosylatcd 

form. 

35. Isolated antibodies that bind to a polypeptide of claim 32. 

36. Isolated antibodies according to claim 35, wherein the antibodies are 
monoclonal antibodies. 

37. A host cell transfected or transduced with the vector of claim 3 1 . 

38. A method for the production of SVPH 3 polypeptide comprising 
culturing a host cell of claim 37 under conditions promoting expression, and 
recovering the polypeptide from the culture medium. 

39. The method of claim 38. wherein the host cell is selected from the 

• ♦ ; • ' • -P ^.-wt r >■*]}< nlrin! ce!N. and am ma! cells. 
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SEQUENCE III STING 

- : 1 1 0 : » I mmun ex Cc r pcrat i or. 

• :12 0> NEW METALLOPEOTE I NAS E - L I 3 INTEGE 1 N FAMILY MEMBERS : SVPK 
DNAS AND POLYPEPTIDES 



L 4 0 > 
L4 1 :> 



: b 0 . J I 1 y - _ L' u j u a 



< 1 5 0 > 6 0 / 1 1 6 , 6 7 0 

• :151> 1999-01-21 

■:15 J> 60/ 13 8,652 

■ : 1 5 1 : » 19 9 9-06-14 

< 1 5 C' > 6 0 / 1 5 5 , 7 9 8 
1 5 1 > 19 9 9-09-27 

< 1 6 0 :> 3 3 

<170> Patent In Ver . 2.1 

< 2 1 0 > 1 
<211> 129 
<212> DNA 

<213> Homo sapiens 



<223> 11 r." at various positions throughout the sequence 
may be any nucleotide 

<400> 1 

atttttgata ccacagtgac caacacggtc acctaaggtg ttcaattctt tgtagcaagt 60 

ctcacttgca gtatttgcgc ctgcaccaaa aatcctccta cactgttcan ttgcggtcat 120 
gacangctc 129 



<210 
•:212 

■ :2i 

2 1 .} 



2 

4 6 9 
DNA 
Home 



sap i ens 



•:4C)0> 2 
c tt t tgac 



aqaacaqat r 



caacccagga ggacatgtga g 



gttt. tag:, aaaactti'c. 
qtcacaat.ac cc tttaatcc 



ctggaact 1 1 
gctt ctt cac 

catggcaggc: 
z gagatggca 
rcttcrcca- 



ctgcctcctg taaacgatgr. gcgggtggta 

gggttcaatg aaaaagtctc cargtggr.ag 

a c t g agar? c r. g c c g t. c c c 
G a g g g g g g . : ■. 1 g a g g a L i g v ' 

a a t a t a g r ~ ^ 1 1: g g a a a g 



a a a g u a c u. a a 
a ca gg t tggc 
ccctccctca 
z z ggaaaaa t 
::ac:gtaq a 
>: GG : 1 Gagg 
g : c " G oaag 



acagat t c t * 
t cc: tggt t : 
accagrgga' 
ccagt cagt ; 

accgtgcca: 
v f. c • \g j t 
1 1 a a a a a a c ( 



120 
ISO 
24 0 
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:212> DNA 

.213 > HcTlO sapiens 



•;4 00-> 3 

gcatgt:c:c 

■:c:gctaccr 

cactgccaan 

aattgcag:t 

atcccaggac 

gaatgtgat: 

aagttgcaa - 

cc stctggat 

gggaatccaa 

gaagqccgrt 

ggacctgatg 

caatttggta 

tcaatatgtg 

acgactataa 

carctatcca 

rg ryyay aag 

gactgtttgc 

tgcatgtatg 

agt gggcc:c 

at aatgttrc 

at aggaaacc 

caggaagaat 

gaat ctgaag 

aaaacaaaca 



tatatcttca 

tagaatatgc 

ggtctgccca 

gtaggggtag 

atatctctt t 

taggt t atgt 

gtggtt :cac 

caggtgccaa 

acgtgtgtag 

gttcrtgccc 

gtttcaggaa 

ccatggaggc 

actgtgagat 

gcaggctaca 

tttctactca 

tgaaacccat 

gccggg t atg 

ctgagaaatg 

ggtgggcacc 

caggactgct 

gcctt atttt 

actt aaaacc 

ctaaaacaaa 

caaaaactgg 

t tgaaagtaa 



aagaaaatcit 

-agatcagrg 

tgagctgggt 

gcctaatcgc 

tt t t aaacat 

get taagaga 

agaggagtgt 

ctgtagcat t 

gcaggaagga 

aaatgacgt t 

ggggtgcaga 

rcctagtgag 

tacaggaatt 

gtgtataaat 

tt tacaggca 

gggaa t acct 

tttiaaa?aa 

caatacccgg 

tccar tctgt 

cagaggggcg 

attaatcct: 

caaacaggaa 

aactgtacag 

acaggaagaa 

acgacccaaa 



aatgatgetc 

agt act 1 1 ac 

catgetgtag 

atcatgggct 

atetcttegg 

tgtggaaaca 

cagaaagatc 

ggactttget 

aatgaatgtg 

tataagcagg 

tccagatata 

tgctatgatg 

cgaaatttt a 

gttgaaacca 

gaaaatct ca 

gacc t aggt a 

^ar tgegtea 

ggtgt ttgea 

gaggaagtgg 

attccctcgt 

tcagtggttt 

aaaatgecac 

gaagaatc t a 

t ctaaagcaa 

gcaaagagtg 



t tgcatggt c 
t agat acaaa 
gaatgtcaca 
caggacgcac 
gagcaacatg 
aaa t tgtgga 
ggtgttgcca 
gtcatgat tg 
accttgeaga 
at ggaacc :c 
t gcagtgc t'a 
cagt taactt 
aaaagtgt ga 
t ccctgatt t 
tgtgctgggg 
tgataaat ga 
atagctcaqt 
acaacagaaa 
ggtatggagg 
caat t tgggt 
ttgtgttttt 
tatccaaagc 
aaacaaaaac 
aaactggaca 
t caagaaaca 



at 1 1 ggaaaa 

tatccttgee 

tgatgaacaa 

tgggtttagc 

t ctaaataat 

ggacaatgag 

atcaaat tgt 

tcggtttcgt 

gtactgegae 

t tgcaagtat 

aagcat 1 1 1 1 

aat aggtgat 

aagtgcaaa t 

gecagagcat 

cacaggctat 

tggcacct cc 

ectqeagt tt 

aaactgccac 

aagcattgac 

tgtgtccatc 

ccggcaagtg 

aaaaactgaa 

tggacaggaa 

ggaagaatct 

aaaaaagtaa 



180 

2 4 0 

3 00 
36 0 

4 2 '0 
4 8 0 
54 o 
6 00 
6 6 0 
72 0 
76 0 
84 0 
9 0 0 

9 6 0 
1020 

10 8 0 
114 0 
12 0 0 
1260 
1320 
1380 
1440 
1500 



<210> 4 
<21I> 40 
<212> PRT 

< 213 > Homo sapiens 
<220> 

<223> "Xaa" at various positions throughout the sequence 
may be any ammo acid 

<400> 4 

Met Thr Ala Xaa Glu Gin Cys Arg Arg lie Phe Gly Ala Gly Ala As 



' r \ i" Lei : G 1 \ ' Asp Arg 



Thr A, a Ser Giu Thr Oyo ;yi 2y:= 
2 0 



e Lys As:: 



WO 00/43525 



3 



PCT/US00/01338 



< 4 0 C > 5 

Glu Asp Trp Val Tyr Tvr Arg lie Ser His Glu Glu Lys Asp Leu Phe 

i 5 10 15 

Phe Asr. Leu Thr Val Asn Glu Gly Phe Leu Ser Asn Ser Tyr lie Met 

2 0 2 5 3 0 

Glu Lys Arg Tyr Gly Asn Leu Ser His Val Lys Met Met Ala Ser Ser 
3 5 4 0 4 5 

Ala Pre Leu Cys His Leu Ser Gj y Thr Val Leu Gin Gin Gly Thr Arg 
5 0 5 5 6 0 

Val Gly Thr Ala Ala Leu Scr Ala Cys His Gly Leu Thr Gly Phe Phe 
65 70 7 5 80 

Gin Leu Pre* His Gly Asp Phe Phe lie Glu Pro Val Lys Lys His Pro 

85 30 95 

Leu Val Glu Gly Gly Tyr His Pro His lie Val Tyr Arg Arg Gin Lys 



Jal Pro Glu Thr Lys Glu Pro Thr Cys Gly Leu 
115. 12 0 



<210> 6 
<21i> 499 
<212> ?RT 

<213> Homo sapiens 
< 4 0 0 > 6 

His Glu Asp Leu Tyr Leu Gin Arg Lys Tyr Asn Asp Ala Leu Ala Trp 
15 10 15 

Ser Phe Gly Lys Val Cys Ser Leu Glu Tyr Ala Gly Ser Val Ser Thr 
20 25 30 

Leu Leu Asp Thr Asn lie Leu Ala Pro A] a Thr Trp Ser Ala His Glu 
3 5 4 0 4 5 

L e u G 1 y His Ala Val G 1 y Met Ser Hi s A s p G 1 u G 1 n Tyr (2 y s Gin Cys 



Arg Gl\ Arg Pro Asn Cys lie Met Gly Scr G J y Arg Thr Gly Phe Ser 

6 5 "0 ~5 8 0 

Asn Cys Ser Tyr J^e Ser Phe Phe Lys His lie Ser Ser Gly Ala Thr 

8 5 9 0 9 5 

Cys Leu Asn Asn lie Pro Gly Leu G J y Tyr Va 1 L-u Lys Arg Cys Gly 

1 0 0 10 5. ii ~. 
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Gly Ala Asn Cys Ser lie Gly Leu Cys Cys His Asp Cys Arq Phc 7^rg 

145 ISC 15b ' " 160 

Pre Scr Gly Tyr Val Cys Arg Gin Glu Gly Asn Glu :ys Asp ^eu Al a 

16 5 17 0 2 7 5 

Glu Ty r C y s 7* s p Gly Asn Scr Ser Ser C y s Pro As n 7\ s p V a .1 Tyr L y s 

18 0 13 5 19 0 

Gin 7isp Hy Thr Pro Cys Lys Tyr Glu Gly Arg Cys Phe Arg Lys Gly 

195 2 0 0 205 

Cys Arg Ser Arg Tyr Met Gin Cys Gin Ser lie Phe Gly Pro Asp Ala 

210 215 2 2 0 

Met Glu Ala Pro Ser Glu Cys Tyr Asp Ala Val Asn Leu lie Gly Asp 

225 230 235 240 

Gin Phe Gly Asn Cys Glu lie Thr Gly lie Arg Asn Phe Lys Lys Cys 



Glu Ser Alia Asn Ser lie Cys Gly Arg Leu Gin Cys lie Asn Val Glu 

26 0 265 27 0 

Thr lie Pro Ast< Leu Pro Glu Kis Thr Thr lie lie Ser Thr His Leu 

27 5 2 8 0 285 

Gin Ala Glu Asn Leu Met Cys Trp Gly Thr Gly Tyr His Leu Ser Met 

29 0 295 300 

Lys Pro Met Gly lie Pro Asp Leu Gly Met lie Asn Asp Gly Thr Ser 

305 310 315 32 0 

Cys Gly Glu Gly Arg Val Cys Phe Lys Lys Asn Cys Val Asn Ser Ser 

325 330 335 

Val Leu Gin Phe Asp Cys Leu Pre Glu Lys Cys Asn Thr Arg Gly Val 

3 4 0 3 4 5 3 50 

Cys Asn Asn Arg Lys Asn Cys His Cys Met Tyr Gl y Trp Ala Pro Pro 

35 5 360 365 

Pile Cvs GLu Glu Val Glv Tyr Gly Gly Scr 1 H P.. :,er Glv Pi . Pro 



Gly L-u Lou Arg Gly Ala lie Pre Ser Ser Lo Trp V.:.l Val Sei iLr 

385 3 90 3 95 4 00 

He Met Phe Arg Leu He Leu Leu He Leu Ser Va 1 Val Phe Va 1 Phe 

4 0 5 4 10 4 15 



Phe Arg G.u Val He Glv As:; His Leu Lys Pr. Lys Sin Glu Lys Me^ 
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Va: Gin Glu Glu Ser Lys Thr Lys Thr 
4 5 0 4 5 5 

Lys Thr Gly Gin Glu Glu Ser Lys Ala 
4 6 5 4 7 C 

Lys Ala Asn lie Glu Ser Lys Arg Pro 
485 

Gin Lvs Lvs 




PCT/USOO/01338 

5 

Gly G]n Glu Glu Ser Glu 7*1 a 
4 6 : ; 

Lys Thr Gly Gin G^u Glu Ser 
4 7 5 4 80 

Lys Ala L y s Ser V a 1 L y s L y s 
4 9 0 4 9 5 



<210> 7 

<211> 2301 

<212> DNA 

<2 1 3 > Homo sapiens 



<400> 7 

atgaagatgt 

caggargagc 

gg caccacca 

cagaaacaca 

t tcacctaca 

tactat catg 

gggggttttc 

ttttctacca 

accatgagat 

gataatccca 

gaaat tgtag 

t tgctggagg 

gt taaggtgt 

gatgcaagga 

atgcaacatg 

ggagctttta 

aaaactttgg 

aaccatgatg 

ccaccaataa 

aggacaaagt 

gggaatggtg 

aaagat ccct 

c 1 1 tgttgca 

gaatgtgatc 

gt ggaagatg 

cgcaatgaac 

t acaaagaat 

tatataaagt 

gaaattccca 

t gctggagt a 

ggaacagagt 

ttgaatagta 

cactgc:att 

agtgttgaca 



tactcctgct 

gaggcatgac 
t tatccaca: 
cagaccaggg 
gt tatgtgga 
aaggaatat: 
cgtttgaaca 
ccggattta: 
ctcagaagca 
tcgtcattga 
atctatatg: 
tattatttgg 
aatctgtgca 
acacctcaca 
gaggaatgtg 
gcactttttc 
aggatacatg 
ctaaatttag 
gtctgcttg a 
ttgttgaaga 
gctgtctgtc 
aagactgca a 
1 1 ccagagt g 
gaattccctg 
agtgcagg ag 
tgaacacc t c 
gtaatatctc 
atatgagt ga 
ctgattacca 
gtgggataga 
at tget ca cc 
g caa::.at :T 

qtqqCCCarc 



gcattgcct t 
tcacagccct 
acctccaggc 
aaaggtcaag 
tgctatcctn 
aggggacccu 
acagataaat 
:ctggtatac 
gcaaaatgaa 
aagttctta: 
caattatcrg 
tattg:taa: 
tctggagatc 
cctgtattgc 
ccttttcaca 
tacaccacac 
aattgcagtg 
t cgttgttca 
c aattgtagt 
aacagtaca c 
aggagaagag 
aaattgcac: 
gt ccctacca 
gtgcaat ggr 
taagg agag g 
qait tttqqt 



t 1 1 1 



ag gt g ac 
aqarat c 



:gt 
: aq 



tcatactart 
1 1 tggggat g 
t c a t a t a t g c 

t qcat 1 1 tgt 
gt ggqacc ct 
cccraaaaqa 



ggggtgtttc 

cc 99" 
tggc 
aagcttttgt 
gaggac cage 
gaatccctgg 
ga :tt tgect 
aagatggaca 
ataacatgc c 
gtgggctggt 
tacattegtt 
atagtggatt 
tggaccaata 
aagtggaagt 
actctaggat 
cgtagttgtg 
gctca t cat c 
caacctagat 
tatggtgat t 
acaaaggaca 
tgtgactgtg 
ccga ttgatg 
t ragggaaag 
actccc cats 
gqct actgtt 
geagqegcaa 
gttggtcact 
t g t g g a a g a a 
gtgcattggg 
aagggacttg 
a t cc acagqc 
aac aagaggg 
cc ca actgee 
aaqaagaaaa 



tgtcctgt tc 
t gat t cctgt 
atatcctgcc 
t ttccaaaca 
cat t tgtcca 
t ttccctcag 
atgaaatcaa 
gtgaggagaa 
gaatggaat t 
ggat ccat t t 
atgaaaggaa 
ccat t ttgga 
a a a a r c t c a t 
eggagaacat 
taagagggt t 
caattgt ta c 
taggtcat aa 
gc ataatgea 
1 1 tgggaata 
tctttaatgt 
g a c c 1 1 1 a a a 
gtt ttacttg 
t gtgtagaaa 
agtg cccag a 
at g aaaaga g 
at artqeaaq 
g t g g t a t z a a 
t tcaqtgtq.a 
ct cget tea a 
at at tggtga 
actgtgccca 
geatcegsaa 
tqataaaagg 
agaagt t z t q 



- , v — -j v — j ci o. w ci- w- 


t5 fi 




110 




! ^ 1 1 1 Qqaqo c 


18 0 


1 cr^c t q 


24 0 


y da LQu 1 — ~~ 


3 0 0 


L. Ly L L. U ^. 


3 0 0 


g 7 c c c t a g c a 


4 2 0 


d w CJ. d L- L_ I— L_ — '-A 


4 8 0 


tgaagaaat t 


54 0 


taggat tgt t 


6 0 0 


cgactcaaag 


6 6 0 


tgtcattggt 


71C 


cgtagtagat 


7 80 


tacgccccgg 


840 


aagtggcata 


9 00 


t t tcatgaac 


96 0 


tttgggcatg 


102 0 


tgaaggcaac 


10 6 0 


tac tgtagag 


114 0 


gaagcgctgt 


12 0 0 


gca t tgtgc a 


1 2 6 r 


tgcttttggg 




ggaggtcaat 


1 :• P. 0 


tga tt tttat 


14 4 (J 


z tgt catgac 


1 5 ».» 0 


t qa q act tg z 


15 


aaatgetaja 




qaatgtga :a 




tgacataat g 


174 0 


agtgaaagat 


1*00 


tataacca: c 


18 6 0 


caataaacat 


1 ?2 0 


ct at qgaggt 


1 j 8 0 


r.tatctgt gt 


2 0 4 -j 
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i 0 ..' 



<210> 8 
<211> 236*. 
<212> DNA 

<213> Hcrr.o sapiens 

<400> 8 

atgaagarg: ca:tcctgct gcartgcctt ggggtgttte tgtcctgttc tggacaca:c 60 

■-aggatgaq: ac:cccaata tcacagccc: ccggacgtgg tgactcctgt gaggaraac: 12 

ggcaccacca gaggcatgac acccccaggc tggctctc'jt ata::ctgcc rtttggagg: 18 

cagaaacaca ttatccaca: aaaggtcaag aagcttttgt tctccaaaca cctccctgtg 24 

ttcacctaca cagaccaggg tgctatcctr gaggaccagc ca::rgtcca gaataactgc 

tactatcatg gttatgtgga aggggaccca gaatc-ctgg ttrccctcag tacctgttnt 3 1 ■ ' 

gggggtttitr aaggaatatt acagataaat gactttgctt atgaaatcaa gcccctagc a 42" 

ctttctacca cgttcgaaca rctggnatac aagatgga:a gtgaggagaa acaactttca 4r; ( .' 

accatgaga^ ccggatttat gcaaaatgaa ataacatg:c gaatggaatt :gaagaaa:: 540 

gataattcca ctragaagca aagttcttat gtgggctggn ggatccattt :aggattgr: 6 0u 

gaaattgtag tcgtcattga caattatctg tacattcgrr atgaaaggaa rgactcaaag 600 

"gctggagg atctatatgt tattgttaat atagtgga:: ccattttgga tgtcattgg: 72u 

gt-taaggtgt rartatttgg tttggagatc tggaccaa:a aaaacctcat cgtagtaga: 760 

gatgtaagga aacctgtgca cctgtattgc aagcggaagt cggagaacat cacgccccgg 840 

atgcaacaug acac^Lcdcd Lc^tttcaca acrccaggat "aagagggtt: aagtiggcatia 9 0c 

ggagctttta gaggaatgtg tacaccacac cgzag-tgtg caa::gt:ac tttcatgaac 960 

aaaactttgg gcaccttttc aattgcagtg gctcatcatc :agg:caraa tttgggcatg 1020 

aaccatgatg aggatacatg tcgttgttca caacccagat gcataatgca tgaaggcaac 1080 

ccaccaataa ctaaatttag caattgtagt tatggtgatt tttgggaata tactgtagag 114 0 

aggacaaagt gtntgcttga aacagcacac acaaaggaca tctttaatgt gaagcgctgc 1200 

ggg a ^tg9-9 ttgtcgaaga aggagaagag tgtgactgtg gacccttaaa gcattgtgca 1260 

aaagatcccr: gctgr-Ctgrc aaattgcact ctgactgatg gttctacttg tgcttttggg 1320 

ctttgttgca aagactgcaa gttcctacca tcagggaaag tgtgtagaaa ggaggccaa: 1380 

gaatgtgacc ttccagagcg gtgcaatggt acttcccata agtgcccaga cgactttta: 1440 

gtggaaga^g gaatrccctg taaggagagg ggctactgct atgaaaagag ctgtcatgac 1500 

cgcaatgaac agngnaggag gattttnggt gcaggcgcaa acacrgcaag tgagacttg: 1560 

tacaaagaat tgaacaccrt aggtgaccgt gttggtcact gtggtatcaa aaatgctaca 1620 

tatataaagt gtaatatc:c agatgtccag tgtggaagaa tncagtgtiga gaatgtgaca 1680 

gaaattccca atatgagtga tcatactact gtgcattggg ctcgcttcaa tgacataatg 1740 

tgctggagta ctgattacca cttggggatg aagggacctg atactggtga agtgaaagat 1800 

ggaacagagt gtgggataga ccatatatgc atccacaggc actgtigtcca tataaccatc 1860 

ttgaatagta attgctcacc cgcattttgt aacaagaggg gcat:tgcaa caataaacat 1920 

cactgccatt gcaattatrt gtgggaccct cccaactgcc :gataaaagg ctatggaggt 198 

agtgttgaca gtggccca:c ccctaagaga aagaagaaaa agaagttctg ttatctgtgt 204 

atattgttgc ttatcgtt:t get tatttta ttatgttgtc rttancgact ttgtaaaaaa 210 

agtaaaccaa t.aaaaaag:a gcaagatgtt caaactccat cr.gcaaaaga agaggaaaaa 2 1'". 

attcagcg:: ga:c:catga gr.tacctccc jagagrcas: cttgggtgat gccttcccag 2220 

agtcaacctc ctgtgacgcc r^cccagagt catcctca^g tgatgccttc ccagagtcau 2230 

:ctcctcaaa attt^ttcr" gt; rcagct.tr :caa:cagtg actgtgtgct aaattttagg 2 24 0' 

rtactgtatc ::caggccac ctga 2 3 64 



<210> 9 

<211> 2463 

< 2 2 2 > DNA 

-.213 > I iom :■ ap i enc 
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ttcacctaca 
tactatcatg 

gggggtzttc 

ttttctacca 
accargaga: 
gataactcca 
gaaattgtag 
ttgctggagg 
gttaaggtgt 
gatgtaagga 
atgcaacarg 
ggagctttta 
aaaactttgg 
aaccacgatg 
ccaccaataa 
aggacaaagt 
gggaatggtg 
aaagatccct 
ctttgttgca 
gaargtgatc 
gtggaagatg 
cgcaatgaac 
tacaaagaat 
tat at aaagt 
gaaattccca 
tgctggagta 
ggaacagagt 
ttgaatagta 
cactgccat t 
agtgttgaca 
at at tgttgc 
agtaaaccaa 
at tcagcgtc 
agtcaacctc 
cctcctgtga 
gtgatgcct t 
ccctcccaga 
tag 



cagaccaggg 
gttatgr.gga 
aaggaatat t 
cgt t tgaaca 
ccggat t tat 
ctcagaag za 
t cgt c at t ga 
azctatat gt 
tattatttgg 
aat ctgtg :a 
acacctca :a 
gaggaatgtg 
gcact ttt t c 
aggataca tg 
ctaaattt ag 
gtttgct tga 
t tgttgaaga 
gctgtctgtc 
aagactg :aa 
ttccagagtg 
gaacccccig 
agtgtaggag 
tg aacacctt 
gtaatatctc 
atatgagtga 
ctgattacca 
gtgggataga 
attgctcacc 
gcaattat ct 
gtggcccacc 
ttattgtttt 
taaaaaagc a 
gacctcatga 
ctgtgacgcc 
tgccttccca 
cccagag:ca 
ggcaacct ca 



tgct at cc t t 
aggggaccca 
acagataaat 
t z tggtacae 
gcaaaatgaa 
aagttct tat 
taat t atct g 
tat tgt taat 
t ttggagat z 
cc tgt a ttgc 
tzttttcaza 
ta zaccacac 
aattgcagtg 
tcgt tgttca 
caat tgtagt 
aazagtaca z 
aggagaagag 
aaat tgcact 
gttcctacca 
gtgcaatggt 

LddUUdydJ'-J 

gat t t t tggt 
aggtgaccg: 
agatgtccag 
t catactact 
t.ztggggatg 
t cat atatgc 
tgcat t ttgt 
gtgggaccct 
ccctaagaga 
gtttatttta 
gcaagatgtt 
gt tacct c zz 
t t cccagagt 
gagtcat czt 
cc ztcagt tg 
gt tgatgcct 



gaggaecagz 
gaa: z ze t g g 
gact t tgct t 
a a gatggaca 
ataacatgc: 
gtg ggctggt 
tacatt cgt t 
at agtggat t 
tggaccaata 
aagtggaag t 
act ztaggat 
cgtagt tgtg 
gctcatcatc 
caac ctagat 
tatggtgatt 
acaaaggaca 
tgtqactgtg 
ztgactgatg 
tcagggaaa g 
acttcccat a 

gcaggcgcaa 
gt tggtcact 
tgtggaagaa 
gtgcat tggg 
aagggacct g 
atccacaggc 
aacaagaggg 
cccaactgc z 
aagaagaaaa 
ttatgt tgt c 
zaaact ccat 
cagagccaac 
catcctcggg 
cagttgacgc 
acgcczteec 
tcccagagti z 



c a c 1 1 g t c c a 
1 1 1 cczt cag 
atgaaatcaa 
gtgaggagaa 
gaatggaat t 
ggatczattt 
at gaa aggaa 
cca 1 1 1 tgga 
aaaacct cat 
cggagaacat 
taagagggtt 
caattgttac 
taggt zataa 
gcataatgca 
" ttgggaata 
tctttaargt 
gacctttaaa 
gtrctacttg 
tgtgt agaaa 
agtgcccaga 
atgaaaciy'ag 
atactgcaag 
gtggtatcaa 
t tcagtgtga 
ctcgcttcaa 
atattggtga 
actgtgtcca 
gcatctgcaa 
tgataaaagg 
agaagttctg 
nttatcgact 
ctgcaaaaga 
ct tgggtgat 
tgatgccttc 
cttcccagag 
agagtcaacc 
aacct cctgt 



^aataaetgz 
tazzt gtttt 
gc zee t ag z a 
acaattttza 
tgaagaaat z 
taggat tgtt 
cgactcaaag 
z g teat tggt 
t gtagtaga t 
tacgccccgg 
aagt gg cat a 
1 1 tcatgaa z 
tt tgggcatg 
t gaaggcaa c 
ta ztgt agag 
gaagcgctgt 
gzattgtgza 
tgcttttggg 
ggaggtcaat 
tgacttttat 
LLyLcatgac 
tgagacttgz 
aaatgetaca 
ga atgtgaca 
tgacataatg 
agtgaaagat 
tataaccatc 
caataaacat 
ctatggaggt 
ttatctgtgt 
ttgtaaaaaa 
agaggaaaaa 
gect t cccag 
tcagagtcaa 
tcaacctczt 
tcctgzga za 
gacgccctcc 



4.:0 
4 8 3 
b4 j 
C 0 O 

6 6 

7 2 0 

7 8 0 

8 4 0 

9 0 0 
96 0 
102 0 
1 0 8 0 
1I4C 
1 j u 0 

12 6 0 
132 0 

1 3 6 0 
144 0 
1 a 0 0 
1 5 6 0 
1 6 2 0 
16 8 0 
1740 
1800 
1860 
192 0 
198 0 
2040 
210 0 
216 0 
2220 
2280 
2340 
2400 
24 60 
2463 



< 2 1 0 > 10 

< 2 1 1 > 2 3 7 3 
<2 12> DNA 

< 2 1 ~l > H onvz s a *o i en s 



< 4 0 0 > 10 



atgaggtcag 


t gcagat z r. t 


z ::ct cecaa 


tgccgtttgz 


tccttctact 


agt t zegae a 


t > 1 ■ 


atg ztcctta 


agtctct tgg 


cgaaga t gta 


atztttcacz 


ctgaagggga 


gtt tgactcg 


1 2 


tatgaagtca 


ccat t cctga 


gaagctgagc 


-teeggggag 


aggtgcaggg 


tgtggtcagt 


1 8 0 




acctactgca 


gt taaaaggc 


aagaagcacg 


tc ct ccat 1 1 


gtggcccaag 


24 0 


a g a c t. 1 c t g t 


tgccczgaca 


t ztgcgcgt t 




cagaacat gg 


gaaactgctg 


3 c ) '"« 


gaggatca: c 


r 1 1 a c a t a c z 


aaagga c t g z 


aactacatgg 


get ccgt gaa 


agagt ctctg 


3 6 I 


gactctaaaq 


ctactataa a 


cazatgcatg 


ggggqtctrc 


gaggtgtat t 


t a a cat tgat 
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c 1 1 caagat g 
aaaatacgcg 
aaaagtgtat 
rataatgatg 
gtgagtactt 
ggccatgctg 
:gcatcargg 
:aratcr.ct; c 
agargtggaa 
tgtcagaaag 
at tggact t t 
ggaaatgaat 
gtttataage 
agatccagat 
gagtgctatg 
at t cgaaat t 
aatgt tgaa a 
gca gaaaat c 
cctgacctag 
aaa aat tgcg 
cggggtgttt 
tgtgaggaag 
gcgat tccc t 
rt t tcagtgg 
gaaaaaatgc 
caggaagaat 
gaatctaaag 
aaagcaaaga 



1 1 cgtatga g 
ttggatatc z 
taaatgctcg 
etc ttgcatg 
ta ctagatae 
taggaatgt z 
gctcaggacg 
egggagcaa e 
acaaaatr. gt 
atcggtgt tg 
gctgtcat ga 
gtgaccttgc 
aggatggaac 
atatgeagt g 
atgcagttaa 
t taaaaagt g 
c:atccccga 
t:atgtgcrg 
gtatgataaa 
tcaatagc: z 
gcaacdauag 
tggggtatgg 
cgtcaatttg 
tttttgtgtt 
cactatccaa 
ctaaaacaaa 
caaaaactgg 
gtgtcaagaa 



agagtt agct 
c c t g t c a t c a 
gt cgt t tgga 
a aat at cct t 
a e a t g a t g a a 
:act gggt t t 
at gt :taaat 
ggaggacaat 
■::aatcaaat 
t tgtcggttt 
agagt actgc 
c rettgeaag 
ccaaagcatt 
:t taataggt 
tgaaagtgca 
t ttgc ragag 
gggcacaggc 
tgatggcace 
agt ectgeag 

aggaagcat t 
ggttgtgtcc 
1 1 1 ccggcaa 
ageaaaaact 
aactggacag 
acaggaagaa 
a:aaaaaaag 



aaggcccctg 
gaagt 1 1 tag 
gat tgggcac 
aaagtgt gt t 
gcccctgcta 
caatactgc z 
agcaat tgc a 
aatatcccag 
gaggaatgtg 
tgtaagttgc 
cgtccatctg 
gaegggaat t 
tatgaaggee 
1 1 tggacct g 
gatcaatttg 
aattcaatat 
catacgacta 
t a teat eta t 
tcetgtggag 
tttgactgtt 
LdCigcatgc 
gacagtgggc 
atcataatgt 
gtgataggaa 
gaacaggaag 
gaagaat ctg 
tctaaagcaa 
t aa 



a a gt a t ggae 
gc agat t tgt 
atttatatct 
et etagaata 
ectggtctgc 
aatgtagggg 
gttatatcte 
gaetaggt ta 
attgtggtte 
aaccaggtg z 
gat acgtgtg 
caagt t cctg 
gttgtt t cag 
atgccatgga 
gtaactgtga 
gtggcaggct 
taatttct^e 
ccatgaaaee 
aaggccgggt 
tgcctgagaa 

ctccagga:t 
ttcgect tat 
accact taaa 
aatct aaaac 
aagcaaaaac 
acattgaaag 



agattttaac 
aatatataaa 
t caaagaaaa 
t gctggat ca 
t eatgaget g 
taggectaat 
tttttttaaa 
tgtgct taag 
ca eagaggag 
caactgtagc 
taggcaggaa 
cccaaatgac 
gaaggggtgc 
ggct cctagt 
gattacagga 
acagtgtata 
tcatttacag 
ca tgggaata 
atgt tt taaa 
a tgcaat a zc 

gc tcagaggg 
tt tattaatc 
acccaaacag 
aaaaactgta 
tggacaggaa 
taaacgaccc 



73 0 

8 4 0 

9 0 0 

9 6 0 
1010 
10c': 
1 1 1 0 

12 0 0 
126- 

13 2 0 

10 8:.' 

1 4 4 

1500 
1 5 0 i. 
16 2' 
168 0 
174 0 
1 8 0 U 
186 0 
1920 

1 'J 8 U 

2 04 0 
210 0 
216 0 
2220 
22 80 
2340 
2373 



<210> 11 
< 2 1 1 > 2346 
<212> DNA 

< 2 1 3 > Homo sapiens 



<400> 11 
atgaggtcag 
atget cct t a 
tatgaagtca 
cccgtgtccc 
agacttctgt 
gaggat ca t 2 
gaet c t aaag 
gecaaacat t 
c t z rtgaaga 
tggcagatgg 
c acccaaagt 
aacaatctt t 
t t tcaagatg 
aaaatacgcg 
aaaagtgt a t 
rataatgatg 
qtqaatact t 



tgeagatett 
agtctcttgg 



acctactgca 
tgccccgaca 

cttacatac: 
ctactataag 
ac caaattg a 
aagagcag:. t 
c c ccttatga 
acttggaat t 
ct caagtcat 
ttegtatgag 
ttggatatcc 
t a a a t g c t c g 
etct tgcat g 
t act aaa t ac 



cctctcccaa 
cgaagatgt a 
gaagctgagt 
gt taaaaggc 
tetgegegtt 
aaaggactg z 
caca tg ratg 
gc:c ctcaag 
*: gggaa t ca g 
gaa ta aggcg 
gat cct act r 
acatgatgee 
gatacactta 
agagt t age t 
cct gr cat ca 
;jtcgt ttgga 
a a a t a t c c 1 1 



tgccgtttge 
atttt tcae z 
ttceggggag 
aagaagcacg 
ttctecttea 
aac t aca t gg 
gggggt zt cc 
gect r_ tec a 
gt 1 1 g- ggc t 
aggc taaggg 
1 1 1 g a t c a a a 
attct tttga 
a a gg c t c 1 1 g 
gaagt tttag 
gat tgggcac 
aaagtgtgtt 
qcccct acta 



tccttctact 
ctgaagggga 
aggtgcaggg 
tcctccattt 
cagaacatgg 
get ccgtgaa 
gaggtgtatt 
gttttgaaza 
t a agt ga t ga 
acttt ectgg 
gt aggt at ag 
ctgggattat 
aagt a tggac 
gcagat t tgt 
atttatatct 
ct ctagaat a 
cctggccrqc 



agttccgaca 60 
gtttgactcg 120 
tgtggtcagt 180 
g t ggc c c a a g 2 4 0 



ggaactgctg 
agagt ct ctg 
taacattgat 
tgtegtctat 
t gaaat agaa 
at cctataaa 
gt t tgtgaac 
ggacacc t ac 
agat 1 1 taac 
aatatataaa 
tcaaagaaaa 
t gc tgga tea 
t ca t gage tg 



2 0 0 

3 60 

4 2 '. 

4 e,o 

r-4 ■ 
£ 



7 e o 

84 0 



002; 
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a t tggac t 1 t 
ggaaatgaat 
gtttataagc 
agatccagat 
gagtgctatg 
att cgaaatt 
aatgt tgaaa 
gcagaaaa t c 
cc tgacct ag 
aaaaac:g:g 
cggggtgtt t 
tgtgaggaag 
gcgattccct 
ct t tcagtgg 
gaaaaaatgc 
caggaagaat 
gaat cnaaag 
aagtaa 



gctigtcanga 
gtgaccngc 
aggatggaa: 
ata:gcag:g 
atgcagttaa 
ttaaaaag:g 
ccatcccrga 
tcatgtgctg 
gtatgacaaa 
tcaatagc: c 
gcaacaacag 
tggggtatgg 
cgtcaatttg 
t t tttgtgt t 
cactatccaa 
ctaaaacaaa 
caaaca t tga 



ttgt cggtt t 
agagnaccgc 
c -ct tgcaag 
ccaaagcatt 
ct taataggt 
t gaaagtgca 
t t tgccagag 
gggcacaggc 
tgatggcacc 
agtcctgcag 
aaaaaactgc 
aggaagcat t 
ggt tgtgtcc 
1 1 1 ccggcaa 
agcaaaaact 
aactggacag 
aagt aaacga 



cgtccatctg 
gacgggaat t 
rargaaggcc 
tttggacctg 
gatcaatttg 
aattcaata: 
:atacgac:a 
tatcatctat 
tcctgtggag 
t t tgactgt t 
cactgcatgt 
gacagtgggc 
atcataatgt 
gtgat aggaa 
gaacaggaag 
gaagaatctg 
cccaaagcaa 



gat a cgtgtg 
caagt tccr g 
gttgtt tea g 
atgccatgg a 
gtaactgtga 
gtggcaggrt 
taatttcta r 
c catgaaa z r 
aaggccgggt 
tgcctgagaa 
atgggtgggc 
ct ccaggact 
ttcgecttat 
acc acttaaa 
aa t ct aaaac 
aagcaaaaac 
agagtgt caa 



t ag gcaggaa 
crraaatgac 

^ggggtgc 

ggctcctagt 
gattacagga 
acagtgtata 
tcatttacag 
catgggaata 
atgttttaaa 
atgeaatace 
acccccattc 
gctcagaggg 
1 1 tat taat c 
acccaaacag 
aaaaac tgt a 
tggacaggaa 
gaaacaaaaa 



1 3 8 0 
144 

150 : 
1 5 6 ■: 

162 
168« 
174 ' 

1 8 o 

186' 
192 ■: 
198' 

2 04' 
2 10' 
2 16', 
2 22i 
2 281 
2 3 4 0 
2346 



<- ^ -L U > 

<211> 766 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Lys Met Leu Leu Leu Leu His Cys Leu Gly Val Phe Leu Ser Cys 
15 10 15 

Ser Gly His lie Gin Asp Glu His Pro Gin Tyr His Ser Pro Pro Asp 

20 25 30 

Val Val lie Pro Val Arg lie Thr Gly Thr Thr Arg Gly Met Thr Pro 
35 40 45 

Pro Gly Trp Leu Ser Tyr lie Leu Pro Phe Gly Gly Gin Lys His lie 
50 55 60 

lie His lie Lys Val Lys Lys Leu Leu Phe Ser Lys His Leu Pro Val 
65 7 0 75 80 

Phe Thr Tyr Thr Asp Gin Gly Ala lie Leu Glu Asp Gin Pro Phe Val 

8 5 9 0 9 5 

Gin Asn Am Cys Tyr Tyr H i s G^y Tvr Val Glu G 1 y Asp F r: • -Gin Ser 
10C 1)5 1 ; C 

Leu Val Ser Leu Ser Thr Cys Phe Gly Gly Phe Gin Gly lie Leu Gin 
115 120 125 

Tie Asn As;: Phe Ala T"/r Glu lie Lys Pro Leu Ala Phe Ser Thr Thr 



WO 00/43525 PCT/US00/O1338 

10 



Phe Glu Glu lie Asp Asn Ser Thr Gin Lys 31 n Ser ber '1'yr Val Gly 
180 135 190 

Trp Trp lie His Phe Arg lie Val Glu lie Val Val Val lie Asp Asn 
19 5 2 0 0 2 0 5 

Tyr Leu Tyr lie Arg Tyr Glu Arg Asn Asp Ser Lys Leu Leu Glu Asp 
210 215 220 

Leu Tyr Val lie Val Asn lie Val Asp Ser lie Leu Asp Val lie Gly 
225 230 235 240 

Val Lys Val Leu Leu Phe Gly Leu Glu lie Trp Thr Asn Lys Asn Leu 
245 250 255 

lie Val Val Asp Asp Val Arg Lys Ser Val His Leu Tyr Cys Lys Trp 
2 6 0 26 5 270 

Lys Ser Glu Asn lie Thr Pro Arg Met Gin His Asp Thr Ser His Leu 
275 2 6 0 285 

Phe Thr Thr Leu Gly Leu Arg Gly Leu Ser Gly lie Gly Ala Phe Arg 
290 295 300 

Gly Met Cys Thr Pro His Arg Ser Cys Ala lie Val Thr Phe Met Asn 
305 310 315 320 

Lys Thr Leu Gly Thr Phe Ser lie Ala Val Ala His His Leu Gly His 
325 33 0 335 

Asn Leu Gly Met Asn His Asp Glu Asp Thr Cys Arg Cys Ser Gin Pro 
340 345 350 

Arg Cys He Met His Glu Gly Asn Pro Pro He Thr Lys Phe Ser Asn 
355 360 365 

Cys Ser Tyr Gly Asp Phe Trp Glu Tyr Thr Val Glu Arg Thr Lys Cys 
370 375 380 

Leu Leu Glu Thr Val His Thr Lys Asp lie Phe Asn Val Lys Arg Cys 

5 8 5 3 9 0 3 9 5 4 0 0 

Gly Asn Gly Val Val G^u Glu Gly Glu Glu Cys Asp- Cys Gly Pre Leu 
4 0 5 4 1 0 4 15 

Lys His Cys Ala Lys Asp P: lys Cy:_ Leu Ser Asn Cys Thr Leu Thr 
4 2 0 4 2 5 4 3 0 



Asp Gly Ser Thr Cys Ala Phe Gly Leu Cys Cys Lys Asp Cys Lys Phe 
435 440 44 5 



WO 00/43525 
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Val Glu Asp Gly ^e Pro Cys Lys Glu Arq GI y Tyr Cys Tyr Glu Lys 

4 5 5 4 9 0 4 9 5 

Ser Cys His Asp Arg Asn Glu Gin Cys Arg Arg He Phe Gly A - a Gly 

500 5C5 510 

A 1 a A. s : i T n i: Ala S e r G 1 u T h r C y s Tyr L y s G 1 u Leu Asn T h r L e u G 1 v 
515 5 2 C 525 

Asp Arg Val Gly His Cys Gly He Lys Asn Ala Thr Tyr He Lvs Cys 
530 53 5 54 0 

Asn He Ser Asp Val Gin Cys Gly Arg He Gin Cys Glu Asn Val Thr 
545 550 555 560 

Glu He Pro Asn Met Ser Asp His Thr Thr V a 1 His Trp Ala Arg P h e 

5 6 5 5 7 0 5 7 5 

Asn Asp He Met Cys Trp Ser Th r Asp T y r His Leu Gly Met Lys G 1 y 

5 8 0 5 8 5 5 9 0 

Pro Asp He Gly Glu Val Lys Asp GI y Thr Glu Cys Gly He Asd His 
595 600 605 

He Cys He His Arg His Cys Val His lie Thr He Leu Asn Ser Asn 
610 615 620 

Cys Ser Pro Ala Phe Cys Asn Lys Arg Gly lie Cys Asn Asn Lys His 
625 630 635 640 

His Cys His 2ys Asn Tyr Leu Trp Asp Pro Pro Asn Cys Leu He Lys 

6 4 5 6 5 0 6 5 5 

Gly Tyr Gly Gly Ser Val Asp Ser Gly Pro Pro Pro Lys Arg Lys Lys 
660 6 65 670 

Lys Lys Lys Phe Cys Tyr Leu Cys lie Leu Leu Leu He Val Leu Phe 
675 680 685 

He Leu Leu Cys Cys Leu Tyr Arg Leu Cys Lys Lys Ser Lys Pro He 
6 9 0 6 9 5 7 0 0 

Lys Ly: : : Gin Gin Asp Val G 1 r. Thr Pro Per AH Lvs G l u Glu Giu Lys 
' 7 Cb HO -15 ' 720 

lie Gin Arg Arg Ho His G^u Leu Pro Pro Gin Ser Gin Pro Tro Veil 

" 2 5 7 3 0 7 3 5 

Met Pro Ser Gin Ser Gin Pro Pro Val Thr Pro Ser Gin Arg Gin Pro 
74 0 74 5 75 0 

Gin Ltu Me T" Pr". Ser Gin S^-r Glu Pr: Pro Mil Thr Pro Ser 



WO 00/43525 PCT/US00/ 01338 

1: 

< 2 1 3 > Homo sapiens 

< 4 0 0 > 13 

Mot Lys Met Leu Lou Leu L e u H is C \ - s L e u G 1 \ - Y a 1 Phe L e 'a Ser C y s 
1 5 10 

Ser Gly His lie Gin Asp G oa Lis Pro Gin Tyr His Ser Pro. Pre. Aso 
20 25 2G 

Val Val lie Pro Val Arg lie Thr Gly Thr Thr Arg Gly Met Thr Pro 
35 40 45 

Pro Gly T r p L o i : Ser Tyr lie Leu Pro P h e Gly Gly Gin Ly s His He 

5 0 5 5 6 0 

lie His lie Lys Val Lys Lys Leu Leu Pho Ser Lys His Leu Pro Val 
65 70 75 80 

Phe Thr Tyr Thr Asp Gin Gly Ala lie Leu Glu Asp Gin Pro Phe Val 

8 5 9 0 95 

Gin Asn Asn Cys Tyr Tyr His Gly Tyr Val Glu Gly Asp Pro Glu Ser 
10 0 10b no 

Leu Val Ser Leu Ser Thr Cys Phe Gly Gly Phe Glu Gly lie Leu Gin 
115 120 125 

lie Asn Asp Phe Ala Tyr Glu lie Lys Pro Leu Ala Phe Ser Thr Thr 
130 135 140 

Phe Glu His Leu Val Tyr Lys Met Asp Ser Glu Glu Lys Gin Phe Ser 
14 5 150 155 160 

Thr Met Arg Ser Gly Phe Met Gin Asn Glu lie Thr Cys Arg Met Glu 
165 170 175 

Phe Glu Glu lie Asp Asn Ser Thr Gin Lys Gin Ser Ser Tyr Val Gly 
180 185 190 

Trp Trp lie His Phe Arg lie Val Glu lie Val Val Val lie Asp Asn 
195 200 205 

Tyr Lou Tyr lie Arg Tyr Gin Arg Asn Ass Ser Lys Leu Lei; Go; Asp 
2 10 2 1 '-...> 2 2 c - 

Lou T/i Val 11- Val Asn lie Val Asp ^ei Lo Leu Asp Vul lie Giv 
22^ 2 2 C 2 .-T- * 24 0 

Val Lys Val Leu Leu Phe Gly Leu Glu lie Trp Thr Asn Lys Asn Leu 
245 250 255 

1 K- Val Val Asp Asp Va.i A : ; Lys Ser Mil His Leu Tyr Cvs Lys Trp 
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Phe Thr Thr Leu Gly Leu Arg Gly Leu Ser Gly lie GI y Al a Fhe Arg 

2 9 0 2 9 5 3 0 0 

Gly Met Cys Thr Pro His Arg Ser Cys Ala lie Val Thr Phe Met Asn 

305 ' 310 315 320 

Lys Thr u eu GO y T h r Phe Ser He Ala V a 1 Ala H .is His Leu G 1 y His 

3 2 5 330 335 

Asn Leu Gly Met Asr. His Asp Glu Asp Thr Cys Arg Cys Ser Gin Pro 

34 0 3 4 E 35 0 

Arg Cys He Met His Glu Gly Asn Pre: pre He Thr Lys Phe Ser Asn 

355 36C 36 5 

Cys Ser Tyr Gly Asp: ?he I'rp Glu Tyr Thr Val Glu Arg Thr Lys Cys 

3 7 0 3 7 5 3 8 0 

Leu Leu Glu Thr Val His Thr Lys Asp lie Phe Asn Val Lys Arg Cys 

385 39 0 395 4 00 

Gly Asn Gly Val Val Glu Glu Gly Glu Glu Cys Asp Cys Gly Pro Leu 

405 410 415 

Lys His Cys Ala Lys Asp Pro Cys Cys Leu Ser Asn Cys Thr Leu Thr 

42 0 425 43 0 

Asp Gly Ser Thr Cys Ala Phe Gly Leu Cys Cys Lys Asp Cys Lys Phe 

435 4 4 0 445 

Leu Pro Ser Gly Lys Val Cys Arg Lys Glu Val Asr: Glu Cys Asp Leu 

450 455 4 6 0 

Pro Glu Trp Cys Asn Gly Thr Ser His Lys Cys Pro Asp Asp Phe Tyr 

465 470 475 480 

Val Glu Asp Gly He Pro Cys Lys Glu Arg Gly Tyr Cys Tyr Glu Lys 

485 490 495 

Ser Cys His Asp Arg Asn Glu Gin Cys Arg Arg He Phe Gly Ala Gly 

5 0 0 5 0 5 510 

Ala Asn Thr Ala Ser Glu Thr Cys Tyr Lys Glu Leu Asn Thr Leu Gly 

5 15 5H 5 HI 

Asp Arg V a 1 GI y H i s 1 y s G 1 y I its L y s A s n Ala T h r Tyr IIkj L y s C > ' s 



Asn lie Ser Asp Val Gin Cys Gly Arg He Gin Cys Glu Asn Val Thr 
545 550 555 560 
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14 

P r o A s p lie G 1 y G 1 u V a 1 Lys Asp G 1 \ ' Thr 3 1 u C y s G .1. y lie A s p His 

£95 6 G C 60 5 

lie Cys lie His Arg His Cys Val His lie Thr lie Leu Asn Ser Asi: 
610 6 15 6 2 C 

(3 y s Ser Pro Ala P h e C y s Asn L y s Arg G 1 y He C y s A s n A s n Lys His 
625 6 30 63b 64 0 

His Cys His Cys Asn Tyr Leu Trp Asp- Pro Pro Asn Cys Leu He Lys 
645 65C 655 

Giy Tyr Giy Giy Ser Val Asp Ser Giy Pro Pro Pro Lys Arg Lys Lys 
66 0 665 670 

Lys Lys Lys Phe Cys Tyr Leu Cys lie Leu Leu Leu He Val Leu Phe 
675 680 685 

lie Leu Leu Cys Cys Leu Tyr Arg Leu Cys Lys Lys Ser Lys Pro He 
6 90 695 700 

Lys Lys Gin Gin Asp Val Gin Thr Pro Ser Ala Lys Glu Glu Glu Lys 
705 710 715 720 

He Gin Arg Arg Pro His Glu Leu Pro Pro Gin Ser Gin Pro Trp Val 
725 730 735 

Met Pro Ser Gin Ser Gin Pro Pro Val Thr- Pro Ser Gin Ser His Pro 
740 745 750 

Gin Val Met Pro Ser Gin Ser Gin Pro Pro Gin Asn Leu Phe Leu Phe 
755 76 0 765 

Ser Phe Ser He Ser Asp Cys Val Leu Asn Phe Arg Leu Leu Tyr Leu 
770 775 780 

Gin Ala Thr 
785 



< 2 1 0 > 14 

-.2H > 8 2 0 

11.. • PFT 

2 : • Ho-o sapiens 

•40,. > 14 

Met Lys Ket Leu Leu Leu Leu His Cys Leu G.i y Val Phe Leu Ser Cys 
1 5 2 0 15 

Ser Giy His He Gin Asp Glu His Pro Gin Tyr His Ser Pro Pro Asp 

2 0 H 3 0 

VH Vul T : Fro V -i 1 A: -a 1 io Thr Giv Chi Thr Arc: Hv Ken Thr Pro 
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15 
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lie His lie Lys Vai lys Lys Lou Lou Fhe Ser Lys His Leu Pro Veil 

6b h '-t so 

Phe Thr Tyr Thr Asp Gin Giv Ala lie Leu Glu Asp Gli: Pro Phe Val 
6 5 9 0 9 5 

Gin Asn Asn Cys Tyr Tyr His Gly Tyr Val Glu Gly Asp Pro Glu Ser 
10 0 105 11C 

Leu Vai Ser Leu Ser Thr Cys Phe Gly Gly Phe Gin Gly lie Leu Gin 
115 12C 125 

lie Asn Asp Phe Ala Tyr Glu lie Lys Pro Leu Ala Phe Ser Thr Thr 
13 0 13 5 1 4 0 

Phe Glu His Leu Val Tyr Lys Met Asp Ser Glu Glu Lys Gin Phe Ser 
145 150 155 160 

Thr Met Arg Ser Gly Phe Met Gin Asn Glu lie Thr Cys Arg Met Glu 
16 5 17 0 17 5 

Phe Glu Glu lie Asp Asn Ser Thr Gin Lys Gin Ser Ser Tyr Val Gly 
18 0 18 5 19 0 

Trp Trp lie His Phe Arg lie Vai Glu lie Val Val Val lie Asp Asn 
195 200 2 0 5* 

Tyr Leu Tyr lie Arg Tyr Glu Arg Asn Asp Ser Lys Leu Leu Glu Asp 
210 215 2 2 0 

Leu Tyr Val He Val Asn Tie Val Asp Ser He Leu Asp> Val He Gly 
22 5 23 0 235 240 

Val Lys Val Leu Leu Phe Gly Leu Glu He Trp Thr Asn Lys Asn Leu 
245 250 255 

He Val Val Asp Asp Val Arg Lys Ser Val His Leu Tyr Cys Lys Trp 
260 265 270 

Lys Ser Glu Asn He Thr Pro Arg Met Gin His Asp Thr Ser His Leu 
275 28 0 285 

Phe Thr Thr Leu Gly Leu Arg Gly Lou Ser Gly He Gly A I a Phe Arg 



Gly Met Cys Thr Pi h His A> g Ser lys Ala lie Val Thr Phe Met Asn 

Lys Thr Leu Gly Thr Phe Ser He Ala Val Ala His His Leu Gly His 

3 2 5 3 3 0 3 3 5 



hi :i : . • As: Glu Asn Th: Aro h\- Ser Gin Pre 
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C y s Scr Tyr Gl \ Asp P h e Trp 0 : u Tyr T h r V a I G 1 u A r g T h r Lys C \ ' s 

3 7 0 3 7 5 3 8 0 

Leu Leu Glu Thr Val His Thr Lys Asp lie Phe A sr. V^l Lys Arg Oys 

3 *3 5> 3 9 0 3 9 5 4 0 0 

Gly Asn Gly Val Val Glu Glu Gly Glu Glu Cys Asp Gys Gly Pro Leu 

4 0 5 4 1 C 4 15 

Lys His Gys Ala Lys Asp Pro Gys Gys Leu 3er Asn Gys Thr Leu Thr 

4 2 0 4 2 5 4 3 0 

A s p G 1 y S e r Thr Oys Ala ?he G 1 y L e u G \ - s G y s L y s Asp G y s L y s P h e 

4 35 440 445 

Leu Pre Scr Gly Lys Val Oys Arg Lys Glu Val Asn Glu Gys Asp Leu 

450 4 55 4 6 0 

Pro Glu Trp Gys Asn Gly Thr Ser His Lys Gys Pro Asp Asp Phe Tyr 

465 4 70 475 430 

V a 1 Glu Asp; Gly lie P r o 2 y s Lys Glu Arg G 1 y Ty r Gy s Ty r Glu Ly s 

485 49 0 4 95 

Ser Oys His Asp Arg Asn Glu Gin Gys Arg Arg lie Phe Gly Ala Gly 

5 0 0 5 0 5 510 

Ala Asn Thr Ala Ser Glu Thr Gys Tyr Lys Glu Leu Asn Thr Leu Gly 

515 520 525 

Asp Arg Val Gly His Gys Gly lie Lys Asn Ala Thr Tyr lie Lys Gys 

530 535 540 

Asn lie Ser Asp Val Gin Gys Gly Arg lie Gin Cys Glu Asn Val Thr 

545 550 555 560 

Glu lie Pro Asn Met Ser Asp His Thr Thr Val His Trp Ala Arg Phe 

565 570 575 

Asn Asp lie Men Gys Trp Ser Thr Asp Tyr His Leu Gly Met: Lys Gly 

58 0' 585 590 

; : As: lie Gl\ Glu Val Lys Asp Gly Thr Glu Cys Gly lie Asp His 

5-5 6 00 6 0 5 

lie 0\\ 1 : < H: ; Arg His Gys V^l His lie G:y He G- ; Asn Sex Asn 

0 10 6 15 6 2 0 

Gy s Ser P r o Ala Phe Cy s A s n Lys Arg G 1 y He Cys As n A s n Ly s H l s 

625 63 0 635 6 4 0 

His Gys his Cy:- Asn Tyr ^eu Trp Asp Pro I ro Asn ys Lei; i 1 - Lys 

64 5 050 65 3 
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Lys :;;s Lyo Phe Gys Tyr Leu Cys lie Leu L,eu Leu I _e Val Leu Pne 

6 7 v 6 8 C 6 fc 5 

I .e Leu ^eu Gys Lys Leu Tyr Arc Leu Gys Lys Lys Ser Lys Pro lie 

£ 9 0 6 3 5 7 0 0 

Lys Lys Gl:i Gir. Asp Val Gin Thr Pro Ser Ala Lys Glu GIu Glu Lys 
705 710 715 720 

lie Gir: A r a Arq Pro His GO u Leu P r Cj P r o Gin Ser Gin P r o 0" r p Val 

Met Pro Ser Gin Ser Gin Pro Pre Val Thr Pro Ser Gin Ser His Pre 
74 0 74 5 75 0 

Arg Val Me: Pro Ser Gin Ser Gin Pro Pro Val Met Pro Ser Gin Ser 
755 760 765 

His Pro Gin Leu Thr Pro Ser Gin Ser Glu Pro Pre Val Met Pro Ser 
77 0 775 780 

Gin Ser His Pro Gin Leu Thr Pro Ser Gin Ser Gir: Pro Pro Val Thr 
785 7 90 795 80 0 

Pro Ser Gin Arg Gin Pro Gin Leu Met Pro Ser Gin Ser Gin Pro Pro 
905 610 815 

Val Thr Pro Ser 

820 



< 2 1 0 > 15 

<211> 790 

<212 > PKT 

<213> Homo sapiens 

<4C0> 15 

Met Arg Ser Val Gin lie Phe Leu Ser Glu Cys Arg Leu Leu Leu Leu 
1 5 10 15 

Leu Val Pro Thr Met Leu Leu Lys Ser Leu Giy Glu Asp Val lie Phe 

2 C 2 5 3 0 



His Pro Glu Giy Glu Phe Asp Sei 



Lt. j u £> e r Phe Arg G ~ y G . u V^I Gir. Gi y Va .. \Li _ Se r I : ■: Mil Sei' Tyr 

5 0 5 5 6 0 

Leu Leu Gin Leu Lys Giy Lys Lys His Val Leu His Leu Trp Pro Lys 

<V- 7C 7 5 8 0 

Arc? Leu Leu Leu Pr :< Arq His Leu Ara Va Phe . • u PL-.- Thr Glu Hrs 
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Met Gly Ser Val Lys Glu Scr Leu Asp Ser Lys Ala Thr lie Ser Thr 
11S 120 12b 

Cys Mec Gly Gly Leu Arg Gly Val Phe Asn lie Asp Ala Lys His Tyr 
13C 135 140 

Gin lie Glu Pro Leu Lys Ala Ser Pre Ser Phe Glu His Val \LJ Tyr 
145 15C 155 160 

Leu Leu Lys Lys Glu Gin Phe Gly Asn Gin Val Cys Gly Leu Ser Asp 
16 5 17 0 17 5 

Asp Glu He Glu Trp Gin Met: Ala Pro Tyr Glu Asn Lys Ala Arg Leu 
18 0 18 5 19 0 

Arg Asp Phe Pre* Gly Ser Tyr Lys His Pro Lys Tyr Leu Glu Leu He 
195 200 205 

Leu Leu Phe Asp Gin Ser Arg Tyr Arg Phe Val Ash Asn Asn Leu Ser 
210 215 220 

Gin Val He His Asp Ala He Leu Leu Thr Gly He Met Asp Thr Tyr 
225 230 235 24C 

Phe Gin Asp Val Arg Met Arg He His Leu Lys Ala Leu Glu Val Trp 
245 250 255 

Thr Asp Phe Asn Lys He Arg Val Gly Tyr Pro Glu Leu Ala Glu Val 
260 26 5 27 0 

Leu Gly Arg Phe Val He Tyr Lys Lys Ser Val Leu Asn Ala Arg Leu 
2 7 5 260 285 

Ser Ser Asp Trp Ala His Leu Tyr Leu Gin Arg Lys Tyr Asn Asp Ala 
29 0 295 300 

Leu Ala Trp Ser Phe Gly Lys Val Cys Ser Leu Glu Tyr Ala Gly Ser 
305 310 315 320 

Val Ser Thr Leu Leu Asp Thr Asn He Leu Ala Pro Ala Thr Trp Ser 

325 330 335 

Ala Hu- GH Lck Gly His Ala Val Gly Met Ser His Asp Glu Gin Tyr 

3 4 0 3 4 5 3 5 0 

lys Glu Cys Arg Giy Arg Pro Asn Cys lie Met Gly Ser Gly Arg Thr 

v c : 360 3 6^ 

Gly Phe Ser Asn Cys Ser Tyr lie Ser Phe Phe Lys His lie Ser Ser 
370 3 75 3 8 0 

Glv Ala Thr Cos Leu Asn Asn He Pro Glv Leu Giv Tvr Val L-u Lvs 
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H> 

Ser Thr Glu Glu Cys Gin Lys Asp Arg Cys Cys Gin Ser A sr. 2ys Lys 

4 2 0 4 2 5 4 3 C 

Leu Gin Pro Gly Ala Asr; Cys Ser lie Gly Leu Gys 3ys His Asp Cys 

4 3 S 4 4 0 4 4b 

Arg Phe Arq P r c Ser G 1 y Tyr V a 1 C y s A r q Gin Glu Gl y A sn » 3 1 u C y s : 

4 b 0 4b- 460 

Asr) Leu Ala Glu Cy r 3y s A s p G 1 y A s n Ser Ser Ser G y s P r o A s n Asp 

4 6 5 4 7 0 4 7 b 4 8 0 

V a 1 Tyr Ly s Gin Asp 3 1 y Thr P r o Gy s Lys Tyr Glu G 1 y Arg C y s Phe 

4 8 5 4 9 '3 4 9 5 

Arg Lys Gly Cys Arg Ser Arg Tyr Met Gin Cys Gin Ser lie Phe Gly 

500 505 510 

Pro Asp Ala Met Glu Ala Pre Ser Glu Gys Tyr Asp A3 a Val Asri Leu 

515 520 52 5 

lie Gly Asp Gin Phe G.ly Asn Cys Glu lie Thr Gly lie Arg Asn Phe 

5 3 0 535 540 

Lys Lys Cys Glu Ser Ala Asn Ser lie Cys Gly Arg Leu Gin Cys lie 

545 550 555 560 

Asn Val Glu Thr lie Pro Asp Leu Pro Glu His Thr Thr lie He Ser 

565 570 575 

Thr His Leu Gin Ala Glu Asn Leu Met Cys Trp Gly Thr Gly Tyr His 

58 0 585 5 90 

Leu Ser Met Lys Pro Met Gly lie Pro Asp Leu Gly Met He Asn Asp 

5 9 5 6 0 0 6 0 5 

Gly Thr Ser Cys Gly Glu Gly Arg Val Cys Phe Lys Lys Asn Cys Val 

610 615 620 

Asn Ser Ser Val Leu (3 in Phe Asp Cys Leu Pro (31 u Lys Cys Asn Thr 

62 5- 630 635 640 

Arg Gly Val Cys Asn Asn Arg Lys Asn Cys His Cys Met Tyr G'i y Trp 

64b H .' H b 

A. la Pre tre Pne Cys 3- u G^u Val Gly Tyi . 3: y G 1 y Ser H < ■ Asp Ser 

6 6 3 6 6 r > 6 7 ( 1 

Gly Pro Pro Gly Leu Leu Arg Gly Ala He Pro Ser Ser He Trp Val 

675 68 0 6 65 

Val Ser IH He Mot Phe Arg Leu He Leu Leu 1H Leu So) VH Val 
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Gin Lys Met Pro Leu Se: Lys Ala Lys Thr Glu Gin Glu Glu Ser Lys 

72^ 7 3 0 7 3 5 

Thr Lys Thr Val Gin Glu Glu Ser Lys Thr Lys Thr Gly Gin Glu Glu 
7 4 G 7 4 5 7 5 0 

Ser Glu Ala Lys Thr GJy Gin Glu Glu Ser Lys Ala Lys Thr Gly Gin 

755 760 76 5 

Glu Glu Ser Lys Ala Asn lie Glu Ser Lys Arg Pro Lys Ala Lys Ser 
7 70 775 780 

Val Lys Lys Gin Lys Lys 
785 790 



< 2 1 0 > 16 
<211> 781 
<212> PRT 

<213> Homo sapiens 
<^4 00> 16 

Met Arg Ser Val Gin lie Phe Leu Ser Gin Cys Arg Leu Leu Leu Leu 
15 10 lb 

Leu Val Pro Thr Met Leu Leu Lys Ser Leu Gly Glu Asp Val lie Phe 
20 25 30 

His Pro Glu Gly Glu Phe Asp Ser Tyr Glu Val Thr lie Pro Glu Lys 
35 40 45 

Leu Ser Phe Arg Gly Glu Val Gin Gly Val Val Ser Pro Val Ser Tyr 
50 55 60 

Leu Leu Gin Leu Lys Gly Lys Lys His Val Leu His Leu Trp Pro Lys 
6 5 70 75 80 

Arg Leu Leu Leu Pro Arg His Leu Arg Val Phe Ser Phe Thr Glu His 

65 90 95 

Gly Glu Leu Leu Glu Asp His Pre Tyr lie Pro Lys Asp Gys Asn Tyr 
10 0 105 110 

117 12 0 1L5 

Gys Met: Gly Gly Leu Arg Gly Val Phe Asn 1!- Asp Ala Ly- His Tyr 
13 0 135 140 



Gin lie Glu Pro Leu Lys Ala Ser Pro Ser Phe Glu His Val Val Tyr 
145 150 155 160 
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Arc Asd Phe Pre Gly Ser Tyr Lys His Pro Lys Tyr Leu Glu Lcj He 



ICC, 



n ^ c 



Le v Phe Asc Gin Ser Arg Tvr Ara Phe Vai Asr. Asn Asn Leu Ser 

" 210 " " " 215 220 

Gin Va] lie His Asp Ala He Leu Leu Thr Gly He Met Asp Thr Tyr 
225 220 225 240 

Phe Gin Asd Val Arg Met Arg He His Leu Lys Ala Leu Glu Val Trp 
245 250 255 

Thr Asp Phe Asn Lvs He Arg Val Gly Tyr Pro Glu Leu Ala Glu Val 
26 0 265 270 

Leu Gly Arg Phe Val He Tyr Lys Lys Ser Val Leu Asn Ala Arg Leu 
27 5. 280 285 

Ser Ser Asp Trp 7^1 a His Leu Tyr Leu Gin Arg Lys Tyr Asn Asp Ala 
290 295 300 

Leu Ala Trp Ser Phe Gly Lys Val Cys Ser Leu Glu Tyr Ala Gly Ser 
3 05 * 310 315 320 

Val Ser Thr Leu Leu Asp Thr Asn He Leu Ala Pro Ala Thr Trp Pro 
325 330 335 

Ala His Glu Leu Gly His Ala Val Gly Met Ser His Asp Glu Gin Tyr 
340 345 350 

Cys Gin Cys Arg Gly Arg Leu Asn Cys lie Met Gly Ser Gly Arg Thr 
355 360 365 

Glv Phe Ser Asn Cys Ser Tyr He Ser Phe Phe Lys His He Ser Ser 
370 375 380 

Glv Ala Thr Cys Leu Asn Asn lie Pro Gly Leu Gly Tyr Val Leu Lys 
385 390 395 400 

Arg Cys Gly Asn Lys He Val Glu Asp Asn Glu Glu Cys Asp Cys Gly 
405 410 415 

Ser Thr Glu Glu Cvs Gin Lys Asp Arg Cys Cys Gin Ser Asn Cys Lys 
420 425 430 

Leu Gin Pro Glv Ala Asn Cys Sei He Gly Leu Cys Oys Has Asp Cys 
4 35 44 0 44 5 

Arg Phe Arg Pro Ser Gly Tyr Val Cys Arg Gin Glu Gly Asn Glu Cys 
450 455 460 

Asr. Leu Ala Glu Tvr Cvs Asp Gly Asn Ser Ser Scr Cys Pi o Asn Asp 

.vv- 4 "5 4 8 0 
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Arg Lys Gly Cys Arg Ser Arg Tyr Me: Gin Cys Gin Ser lie Phe Glv 

50C 505 * 510 

Pro Asp Ala Met Glu A. la Pro Ser Glu Cys Tyr Asp Ala Vai Asn Leu 
515 52C 52b 

lie Gly Asp Gin Phe Gly As:i Cys Glu lie Thr Giy lie Arg Asn Phe 
530 535 540 

Ly s Lys Cys Glu Ser Ala Asn Ser lie Cys Gly Arg Leu G 1 n Cy s lie 
545 550 555 " ' 56 0 

Asn Va. Glu Thr He Pro Asp Leu Pro Glu His Thr Thr lie He Ser 
565 57C 575 

Thr His Leu Gin Ala Glu Asn Leu Met Cys Trp Gly Thr Gly Tyr His 
580 535 590 

Leu Ser Met Lys Pro Met Gly He Pro Asp Leu Gly Met lie Asn Asp 
595 60C " 605 

Gly Thr Ser Cys Gly Glu Gly Arg Val Cys Phe Lys Lys Asn Cys Val 
610 615 62 0 

Asn Ser Ser Val Leu Gin Phe Asp Cys Leu Pro Glu Lys Cys Asn Thr 
625 630 635 ' * 640 

Arg Gly Val Cys Asn Asn Arg Lys Asn Cys His Cys Met Tyr Gly Trp 
645 650 655 

Ala Pro Pro Phe Cys Glu Glu Val Gly Tyr Gly Gly Ser He Asp Ser 
660 66 5 670 

Gly Pro Pro Gly Leu Leu Arg Gly Ala He Pro Ser Ser He Trp Val 
675 680 685 

Val Ser He He Met Phe Arg Leu He Leu Leu He Leu Ser Val Val 
690 695 700 

Phe Val Phe Phe Arg Gin Val lie Gly Asn His Leu Lys Pro Lys Gin 
705 710 715 * 720 

Glu Lys Met \ i\. Leu Ser Lys Ala Lys Thr C. u Glu Glu Glu Ser Lvs 



Chi 



. u S f 



Glu Gl; 



M5 



Ser Glu Ala Lys Thr Gly Gin Glu Glu Ser Lys Ala Asn lie Glu Ser 
755 760 765 



WO 00/43525 



PCT/US00/01338 



-213 . 



: i a 1 Sequence 



-.220 :.> 

• -2 2.;. > Description of Artificial Sequence: 
cligc.TJcIeotide 

•:4 0u> 

caccLaa<7g: gttcaattct ttg 



2 : *' > i 

- 2 1 ; > 2 j 

..2 12 > DIJA 

• ; 2 i 2 > Artificial Sequence 



2 2 > 

• :2 2j. > Description of /artificial Sequence: 
ol igonucieot ide 



•:400;> 18 

caaataotgc aagtgagact tgr 



<21Ci> 29 

^ 2 1 1 > 2 4 

<212> DIJA 

<213> Artificial Sequence 
<.22 0 > 

<22 3> Description of Artificial Sequence: 
ol igonuc 1 ect ide 

■.4 00> 1 ^ 

tgcacaacta cgtgtggtgt accc 24 



:10> 20 

.»11> 2 6 

112 > DIJA 

12 3> Artificial Sequence 



-.22 0 > 

2 1 > E >e s cr.iption of A2~ 1 1 f : c i a 2 S e q uen c e : 



< 4 oo > : G 

g a q ccactg c a a 1 1 q a a a a a a t g o c 



<210> 2 1 

<21i> 21 

•-2 12> DNA 

< 2 1 j> > Artificial Sequence 



WO 00/43525 



24 



PCT/US00/01338 



4 : • 2 1 
■aatgargc 



tc ttgcatggtc g 



21:: ■ dna 

21>, > Artificial Seauence 



<22'.-. • description of Artificial Sequence 
ol lgcnucleot ide 

■;40U . 22 

ctttoaegga geccatgtag trgcag 



26 
DNA 



-20 



Description of Artificial Sequence: 
ol igonucleot icie 



<4 00 2 3 

tgaaggagaa aacgegcaga tgtegg 



26 



^2 ID > 24 
< 2 1 _ - 2 6 
<212 * DNA 

<21D > Artificial Sequence 



Description of Art i f i ci al Sequence : primer 



<4 0(. :> 24 

tegataatge atgaaggcaa cccacc 



4 0 0 
■aaTit 



. t. 



Art. iricial 



Description of Artificial Sequence : primer 



cac t t ejeaata t t t -jcqco 
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<22 0 - 

• 22 J. > Description of Artir lcial Sequence: pr:ir.er 
<400> 2 6 

gccactgcar. gtatgggtg 19 



: 2 1 .) > 2^ 

:211> 2 1 

-212 :■- DNA 

:2i:— > Artificial Sequence 



<22°> Description of Artificial Sequence: primer 
<4 00> 2^ 

gacaotcttc gctttgggtc g 



<^i'j:> 2 8 

..2 11> 8 

<2 12> PRT 

<213> Artificial Sequence 
<22 0> 

<222> Description of Artificial Sequence: peptide 
fragment 

< 4 0 0 > 2 8 

Asp Tyr Lys Asp Asp Asp Asp Lys 



< 2 1 0 > 2 9 
<211> 2 7 
<212> PRT 

<213 > Artificial Sequence 

<22 3> Description of Artificial Sequence: peptide 
f ragment 

> 1 3 



Vai Gin His Leu Gin Ala Ala Phe Ser G.n Tyr 
2 0 2 5 
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-.2 2.} > Description ot Artif .C3dl Sequence: peptide 
f ragment 

•:4C0:- 30 

Arq Met Lys Gin lie Glu Asp Lys Tie Glu Glu lie leu Ser Lys lie 
1 E> 10 15 

T y r His lie Glu As :i Glu lie Ala A r g 1 1 e Lys L y s Leu lie G 1 y G 1 u 
2G 25 30 



Arg 



:21j> 31 

: 2 1 1 > 1 2 

.212> PRT 

:213. Artificial Sequence 



■.22 3 Description of Artificial Sequence: oeptide 
t ragment 

-:22 0 > 

<;22 3 > "Xaa" at various positions throughout the sequence 
may be any amino acid 

< 4 0 0 > 3 1 

His Glu Xaa Xaa His Xaa Xaa Gly Xaa Xaa His Asp 
1 5 10 



<211> 9 
<-212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
fragment 

< : 4 0 0 > 3 2 

Ser Gin Ser Gin rro Pro Leu Met Pro 



-.2 1 1 > 9 

<:213r> Artificial Sequence 

-.22.-;.> Do script .U.-n ; r Art iL::.n S equine : iv;;t i:i- 
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< 4 0 0 > 2 2 

Gin Glu Glu Ser Lys Xaa Lys Thr Gly 



1 



